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to my name; I believe that I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint 
inventor (if plural names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled 
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as PCT International Application No. on and was amended under Article 19 on 
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as amended by any amendment(s) referred to above. I acknowledge the duty to disclose to the Patent and Trademark Office all 
information known to me to be material to patentability as defmed in 37 C.F.R. §1.56. 

I hereby claim foreign priority benefits under 35 U.S.C. §119 of any foreign application(s) for patent or inventor's certificate 
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at least one country other than the United States of America filed by me on the same subject matter having a filing date before that of 
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disclosed in the prior application(s) in the manner provided by the first paragraph of 35 U.S.C. §11 2, 1 acknowledge the duty to disclose 
to the Office all mformation known to me to be material to patentability as defined in37C.F.R.§1.56 which occurred between the filing 
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09/624,693 24/07/2000 Pending 

(Application Serial Number) (Day/Month/Year Filed) (Status-Patented, Pending or Abandoned) 

I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under 18 U.S.C. §1001 and that such willful false statements may jeopardize 
the validity of the application or any patent issued thereon. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
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Yoshida et al. 
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For: Phenylalanine Ammonia Lyase 
Polypeptide and Polynucleotide 
Sequences and Methods of Obtaining 
and Using Same 
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Examiner: unassigned 
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States Postal Service under Maihng 
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DECLARATION OF BIOLOGICAL DEPOSIT 
IN COMPLIANCE WITH THE BUDAPEST TREATY 



Commissioner for Patents 
Washington, D.C. 20231 

Dear Sir or Madam: 

Audrey L.Bartnicki, hereby states as follows: 



1. I am an attorney of record for the above-identified patent application, and 
as such I am authorized to act on behalf of PCBU SERVICES, INC., the assignee of the 
application, having its principal place of business at 300 Delaware Avenue, 9th Floor- 
5403, Wilmington, DE 19801. 

2. PCBU SERVICES, INC. is the assignee of the above-identified patent 
application as evidenced by an assignment dated August 3, 2001, from the co-inventors, 
Roberta K. Yoshida and Aima B. Kootstra, which was filed herewith for recordation in the 
U.S. Patent and Trademark Office. 
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3. The following strain, described in the specification of the above-identified 
application at page 68, lines 26-29, was deposited with the American Type Culture 
Collection, 10801 University Boulevard, Manassas, VA 20110-2209 under the terms of the 
Budapest Treaty: 



Original Strain Name 


ATCC Strain Name 


Date of Deposit 


RY624 (i.e., plasmid pY141 
introduced into E. coli XLl- 
Blue) 


PTA-2224 


July 12, 2000 



A copy of the "Receipt in the Case of an Original Deposit Issued Pursuant to Rule 7.3 and 
Viability Statement Issued Pursuant to Rule 10.2" is appended. 



4. The American Type Culture Collection is a depository in accordance with 
the Budapest Treaty for the above-deposited cultures. Should the cells mutate, become 
non-viable, non-functional, or be inadvertently destroyed, the assignee will replace such 
cells for at least thirty years from the date of the original deposit, or for at least five years 
from the date of the most recent request for release of a sample, or for the enforceable life 
of any patent issued on the above-identified application, whichever period is longest. 

5. The deposit has been made under conditions of assurance of (a) ready 
accessibility thereto by the public if an enforceable patent is granted whereby all restrictions 
to the availability to the public of the cell lines so deposited will be irrevocably removed 
upon the granting of the patent, and (b) access to the cell lines will be available during 
pendency of the patent application to one determined by the Conraiissioner for Patents to be 
entitled thereto tmder applicable statotes and regulations. 

6. All statements made herein of my own knowledge are true and all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both under section 1001 of Title 18 of the 
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United States Code, and that such willful false statements may jeopardize the validity of 
application or any patent issued thereon. 



MARSHALL, GERSTEIN & BORUN 

6300 Sears Tower 

233 South Wacker Drive 

Chicago, Illinois 60606-6402 

Telephone: (312) 474-6300 



MARSHALL, GERSTEIN & BORUN 



August 24, 2001 



By 
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UNITED STATES PATENT AND TRADEMARK OFFICE 
DOCUMENT CLASSIFICATION BARCODE SHEET 



Sequence Listing 



SEQUENCE LISTING 



<110> Yoshida, Roberta 
Kootstra, Anna 

<120> Phenylalanine Ammonia Lyase Polypeptide and 

Polynucleotide Sequences and Methods of Obtaining and 
Using Same 

<130> 29479/500NSCA 

<150> US 09/624, 693 
<151> 2000-07-24 

<150> PCT/USOl/23270 
<151> 2001-07-24 

<160> 30 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
primer OLI 61 

<400> 1 

gacggatcca ctatggcbcc btcsgtsgac tcgat 35 



<210> 2 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modified_base 
<222> (13) 

<223> n = (a or c or g or t/u) or (unknown or other) 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 63 

<400> 2 

gacgaattct tangccatca tcttsacsag gac 33 
<210> 3 
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<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modif ied_base 

<222> (24, 25, 29, 30, 34, 35) 

<223> n = i or inosine 

<223> Description of Artificial Sequence: Synthetic 
Primer AAP 

<400> 3 

ggccacgcgt cgactagtac gggnngggnn gggnng 

<210> 4 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer GSP2 

<400> 4 

cgcgaattca gaatgccctc gtcgtccttg acc 



<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer AUAP 

<400> 5 

ggccacgcgt cgactagtac 



<210> 6 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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Primer GSP4 
<400> 6 

ccggaattcc gacgagccgg aaaggagcgt gcg 



<210> 7 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer AP 

<400> 7 

ggccacgcgt cgactagtac tttttttttt ttttttt 37 



<210> 8 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer GSP5 

<400> 8 

ggtcaaggac gacgagggca ttct 24 



<210> 9 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

Primer GSP6 
<400> 9 

ccgggatcca tgatgcacgc ctactcgact ctctcgct 38 



<210> 10 
<211> 36 
<212> DNA 
<213> Artific 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 77 

<400> 10 

atcgaattcc actctaaccc gtcactagca ctcgcg 



<210> 11 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 78 

<400> 11 

atcggatccc acgacacgac gtcgaaaagc tggtct 



<210> 12 
<211> 2419 
<212> DNA 

<213> Rhodotorula graminis 

<220> 

<221> CDS 

<222> (37) . . (2196) 



<220> 

<221> modif ied_base 
<222> (494) 

<223> Other information: y = t or c 



<220> 

<221> unsure 

<222> (493)... (495) 

<223> Other information: Xaa = Val or Ala 



<400> 12 

ctcctgcctc actctaaccc gtcactagca ctcgcg atg gcc cct tec ttg gac 

Met Ala Pro Ser Leu Asp 
1 5 
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teg etc gcc acc acg etc gcc aac ggc ttt acc aac ggc teg cac gee 
Ser Leu Ala Thr Thr Leu Ala Asn Gly Phe Thr Asn Gly Ser His Ala 
10 15 20 

get ccg ace aag teg get gcg gge ccc act teg get etc cge cgc acg 
Ala Pro Thr Lys Ser Ala Ala Gly Pro Thr Ser Ala Leu Arg Arg Thr 
25 30 35 

cce ggc etc gat ggc cac gcc gcg cac eag teg cag etc gag ate gtg 
Pro Gly Leu Asp Gly His Ala Ala His Gin Ser Gin Leu Glu He Val 
40 45 50 

eag gag etc etc age gae ccc acc gac gac gtc gtc gag etc age ggg 
Gin Glu Leu Leu Ser Asp Pro Thr Asp Asp Val Val Glu Leu Ser Gly 



tac age etc acc gtc cgt gac gtt gtc ggc gee gee cgc aag ggg cge 
Tyr Ser Leu Thr Val Arg Asp Val Val Gly Ala Ala Arg Lys Gly Arg 



agg gtc cgc gtc cag aac gac gac gag ate cgc gca cgc gtc gae aag 
Arg Val Arg Val Gin Asn Asp Asp Glu He Arg Ala Arg Val Asp Lys 
90 95 100 

age gte gac ttc etc aag gcc cag ctt cag aac teg gtc tac gga gtc 
Ser Val Asp Phe Leu Lys Ala Gin Leu Gin Asn Ser Val Tyr Gly Val 
105 110 115 

acc acg ggt ttc ggt ggc teg gcc gac acg agg act gag gat gca gtc 
Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr Arg Thr Glu Asp Ala Val 
120 125 130 

age etc cag aag gcg etc ate gag cac cag etc tgc gge gtg acg ccg 
Ser Leu Gin Lys Ala Leu He Glu His Gin Leu Cys Gly Val Thr Pro 
135 140 145 150 

acg tee gyc teg tec ttc age gtc gga cgc ggc etc gag aac acg ctt 
Thr Ser Xaa Ser Ser Phe Ser Val Gly Arg Gly Leu Glu Asn Thr Leu 
155 160 165 

ccg etc gag gte gtc cge ggc gcc atg gte ate cgc gtc aac teg etc 
Pro Leu Glu Val Val Arg Gly Ala Met Val He Arg Val Asn Ser Leu 
170 175 180 

acg cgt ggc cac teg gcc gtc cgc etc gtc gtc ctt gag gcg etc acc 
Thr Arg Gly His Ser Ala Val Arg Leu Val Val Leu Glu Ala Leu Thr 
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aac ttc ttg 
Asn Phe Leu 
200 

ate teg gcg 
He Ser Ala 
215 

ate acc ggt 
He Thr Gly 



gag aag ate 
Glu Lys He 



gea gtc gte 
Ala Val Val 
265 

gee gtc tec 
Ala Val Ser 
280 

etc teg etc 
Leu Ser Leu 
295 

gtc ggc cag 
Val Gly Gin 



ccg cac cce 
Pro His Pro 

tec ggc teg 
Ser Gly Ser 
345 

gac gac gag 
Asp Asp Glu 
360 

cct cag ttc 
Pro Gin Phe 
375 



aac cac ege 
Asn His Arg 



teg ggc gac 
Ser Gly Asp 
220 

cac ecc gac 
His Pro Asp 
235 

atg ttt gcg 
Met Phe Ala 
250 

etc ggc ccg 
Leu Gly Pro 



gee teg atg 
Ala Ser Met 



etc teg cag 
Leu Ser Gin 
300 

cag ggc tog 
Gin Gly Ser 
315 

ggc cag gte 
Gly Gin Val 
330 

teg ttt gee 
Ser Phe Ala 



ggc att ctt 
Gly He Leu 



etc ggc ccg 
Leu Gly Pro 
380 



ate aeg cce 
He Thr Pro 
205 

etc age ccg 
Leu Ser Pro 



gtc aag gtt 
Val Lys Val 



cgc gag gee 
Arg Glu Ala 
255 

aag gag ggt 
Lys Glu Gly 
270 

gcg acc etc 
Ala Thr Leu 
285 

gee ttg aeg 
Ala Leu Thr 



ttc gcg ccg 
Phe Ala Pro 



gag gtc gcg 
Glu Val Ala 
335 

gtt gag cac 
Val Glu His 
350 

cgc cag gac 
Arg Gin Asp 
365 

etc gtg gag 
Leu Val Glu 



ate gtc cce 
He Val Pro 
210 

etc teg tac 
Leu Ser Tyr 
225 

cac gtt ttg 
His Val Leu 
240 

ate teg etc 
He Ser Leu 



etc ggt ctg 
Leu Gly Leu 



agt ctg cac 
Ser Leu His 
290 

get etc aeg 
Ala Leu Thr 
305 

ttc ate cac 
Phe He His 
320 

cgc aac ate 
Arg Asn He 

gag gag gag 
Glu Glu Glu 



cgc tac ccg 
Arg Tyr Pro 
370 

gac atg atg 
Asp Met Met 
385 



etc cgc ggc 
Leu Arg Gly 



ate gee ggc 
He Ala Gly 



cac gag gga 
His Glu Gly 
245 

ttt ggt etc 
Phe Gly Leu 
260 

gte aac gga 
Val Asn Gly 
275 

gac teg cac 
Asp Ser His 



gtg gag gee 
Val Glu Ala 



gac gtc tgc 
Asp Val Cys 
325 

cgc aeg etc 
Arg Thr Leu 
340 

gte aag gte 
Val Lys Val 
355 

etc cgc aeg 
Leu Arg Thr 



cac gee tac 
His Ala Tyr 



tee 678 
Ser 



gee 726 

Ala 

230 

acc 774 
Thr 



gag 822 
Glu 



aeg 870 
Thr 



atg 918 
Met 



atg 966 

Met 

310 

cgc 1014 
Arg 



ctt 1062 
Leu 



aag 1110 
Lys 



teg 1158 
Ser 



teg 1206 

Ser 

390 
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act etc teg etc gag aac aac aeg acg acc gae aac ccg etc etc gac 
Thr Leu Ser Leu Glu Asn Asn Thr Thr Thr Asp Asn Pro Leu Leu Asp 
395 400 405 

gtc gag aac aag cag acc gcg cac gge ggc aac tte eag gcg teg get 
Val Glu Asn Lys Gin Thr Ala His Gly Gly Asn Phe Gin Ala Ser Ala 
410 415 420 

gtc teg att teg atg gag aag acc agg etc gea etc gee etc ate ggc 
Val Ser lie Ser Met Glu Lys Thr Arg Leu Ala Leu Ala Leu He Gly 
425 430 435 

aag etc aac ttc acg cag tgc acc gag ttg etc aac get gee atg aac 
Lys Leu Asn Phe Thr Gin Cys Thr Glu Leu Leu Asn Ala Ala Met Asn 
440 445 450 

cgc ggc ctg cct teg tgc etc get gee gag gac ccg teg etc aac tat 
Arg Gly Leu Pro Ser Cys Leu Ala Ala Glu Asp Pro Ser Leu Asn Tyr 
455 460 465 470 

cac ggc aag ggc ttg gac att cac ate get get tac get teg gag etc 
His Gly Lys Gly Leu Asp He His He Ala Ala Tyr Ala Ser Glu Leu 
475 480 485 

gge cac ctt gee aac ccg gtc act acc ttc gtc cag ccc gea gag atg 
Gly His Leu Ala Asn Pro Val Thr Thr Phe Val Gin Pro Ala Glu Met 
490 495 500 

ggc aac cag gee gtc aac teg etc get etc ate tec gcg cgc cgc act 
Gly Asn Gin Ala Val Asn Ser Leu Ala Leu He Ser Ala Arg Arg Thr 
505 510 515 

gcc gag gcc aac gac gtc ctt tct etc ctt etc gee teg cac ctg tac 
Ala Glu Ala Asn Asp Val Leu Ser Leu Leu Leu Ala Ser His Leu Tyr 
520 525 530 

tgc acg etc cag gcc gtc gac etc cgc gcg atg gag etc gac ttc aag 
Cys Thr Leu Gin Ala Val Asp Leu Arg Ala Met Glu Leu Asp Phe Lys 
535 540 545 550 

aag cag ttc gac ccg ctt etc ccg act etc etc cag cag cac etc ggc 
Lys Gin Phe Asp Pro Leu Leu Pro Thr Leu Leu Gin Gin His Leu Gly 
555 560 565 

act ggc etc gac gtc aac gea ctt gcg etc gag gtc aag aag gcg etc 
Thr Gly Leu Asp Val Asn Ala Leu Ala Leu Glu Val Lys Lys Ala Leu 
570 575 580 
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aac aag cgt etc gag cag acg acg acg tac gac etc gag ccg cgc tgg 
Asn Lys Arg Leu Glu Gin Thr Thr Thr Tyr Asp Leu Glu Pro Arg Trp 
585 590 595 



1830 



cac gac gee ttc teg tac gcg acc ggc acc gtc gtc gag etc etc teg 1878 
His Asp Ala Phe Ser Tyr Ala Thr Gly Thr Val Val Glu Leu Leu Ser 
600 605 610 

tec teg cec tct gee aac gtc acc ctt act gee gtc aac gcg tgg aag 1926 
Ser Ser Pro Ser Ala Asn Val Thr Leu Thr Ala Val Asn Ala Trp Lys 
615 620 625 630 

gtt gcc teg gee gag aag gee ate teg etc acg cgc gag gtg ege aac 1974 
Val Ala Ser Ala Glu Lys Ala lie Ser Leu Thr Arg Glu Val Arg Asn 
635 640 645 

cgc ttc tgg cag acg ccg tct teg cag gcg ccg gcg cac gca tac etc 2022 
Arg Phe Trp Gin Thr Pro Ser Ser Gin Ala Pro Ala His Ala Tyr Leu 
650 655 660 

teg ccg cgc acg cgc gtc etg tac teg ttc gtg cgc gag gag etc ggc 2070 
Ser Pro Arg Thr Arg Val Leu Tyr Ser Phe Val Arg Glu Glu Leu Gly 
665 670 675 

gtg cag gcg cgc ege gge gac gtg ttt gtc ggc gtg cag cag gag aeg 2118 
Val Gin Ala Arg Arg Gly Asp Val Phe Val Gly Val Gin Gin Glu Thr 
680 685 690 

ate ggg age aac gte teg ege ate tac gag gcc ate aag gac ggc ege 2166 
lie Gly Ser Asn Val Ser Arg lie Tyr Glu Ala lie Lys Asp Gly Arg 
695 700 705 710 

ate aac cac gtc etc gtc aag atg etc gcg taaggcccga gcaagecteg 2216 
lie Asn His Val Leu Val Lys Met Leu Ala 
715 720 

cetagaegee egcctcaccc caagaccagc ttttcgacgt cgtgtcgtgc caagaacgga 227 6 

etttectcca tacacatgtc gccttactet etcgeegtca tcacgtetct cagttctttc 2336 

gtateecgeg tetcteggtc gtcagtacae gtgtatagag cctggaatgg attgcaagtc 2396 

ttcgagttca aaaaaaaaaa aaa 2419 



<210> 13 
<211> 720 
<212> PRT 

<213> Rhodotorula graminis 
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<220> 



<221> unsure 
<222> (153) 

<223> Other information: Xaa = Val or Ala 



<400> 13 

Met Ala Pro Ser Leu Asp Ser Leu Ala Thr Thr Leu Ala Asn Gly Phe 
15 10 15 

Thr Asn Gly Ser His Ala Ala Pro Thr Lys Ser Ala Ala Gly Pro Thr 
20 25 30 

Ser Ala Leu Arg Arg Thr Pro Gly Leu Asp Gly His Ala Ala His Gin 
35 40 45 

Ser Gin Leu Glu lie Val Gin Glu Leu Leu Ser Asp Pro Thr Asp Asp 
50 55 60 

Val Val Glu Leu Ser Gly Tyr Ser Leu Thr Val Arg Asp Val Val Gly 



Ala Ala Arg Lys Gly Arg Arg Val Arg Val Gin Asn Asp Asp Glu lie 
85 90 95 

Arg Ala Arg Val Asp Lys Ser Val Asp Phe Leu Lys Ala Gin Leu Gin 
100 105 110 

Asn Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr 
115 120 125 

Arg Thr Glu Asp Ala Val Ser Leu Gin Lys Ala Leu lie Glu His Gin 
130 135 140 

Leu Cys Gly Val Thr Pro Thr Ser Xaa Ser Ser Phe Ser Val Gly Arg 
145 150 155 160 

Gly Leu Glu Asn Thr Leu Pro Leu Glu Val Val Arg Gly Ala Met Val 
165 170 175 

lie Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val Arg Leu Val 
180 185 190 

Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Arg lie Thr Pro lie 
195 200 205 

Val Pro Leu Arg Gly Ser lie Ser Ala Ser Gly Asp Leu Ser Pro Leu 
210 215 220 
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Ser Tyr lie Ala Gly Ala lie Thr Gly His Pro Asp Val Lys Val His 
225 230 235 240 



Val Leu His Glu Gly Thr Glu Lys He Met Phe Ala Arg Glu Ala He 
245 250 255 

Ser Leu Phe Gly Leu Glu Ala Val Val Leu Gly Pro Lys Glu Gly Leu 
260 265 270 

Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala Thr Leu Ser 
275 280 285 

Leu His Asp Ser His Met Leu Ser Leu Leu Ser Gin Ala Leu Thr Ala 
290 295 300 

Leu Thr Val Glu Ala Met Val Gly Gin Gin Gly Ser Phe Ala Pro Phe 
305 310 315 320 

He His Asp Val Cys Arg Pro His Pro Gly Gin Val Glu Val Ala Arg 
325 330 335 

Asn He Arg Thr Leu Leu Ser Gly Ser Ser Phe Ala Val Glu His Glu 
340 345 350 

Glu Glu Val Lys Val Lys Asp Asp Glu Gly He Leu Arg Gin Asp Arg 
355 360 365 

Tyr Pro Leu Arg Thr Ser Pro Gin Phe Leu Gly Pro Leu Val Glu Asp 
370 375 380 

Met Met His Ala Tyr Ser Thr Leu Ser Leu Glu Asn Asn Thr Thr Thr 
385 390 395 400 

Asp Asn Pro Leu Leu Asp Val Glu Asn Lys Gin Thr Ala His Gly Gly 
405 410 415 

Asn Phe Gin Ala Ser Ala Val Ser He Ser Met Glu Lys Thr Arg Leu 
420 425 430 

Ala Leu Ala Leu He Gly Lys Leu Asn Phe Thr Gin Cys Thr Glu Leu 
435 440 445 

Leu Asn Ala Ala Met Asn Arg Gly Leu Pro Ser Cys Leu Ala Ala Glu 
450 455 460 

Asp Pro Ser Leu Asn Tyr His Gly Lys Gly Leu Asp He His He Ala 
465 470 475 480 
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Ala Tyr Ala Ser Glu Leu Gly His Leu Ala Asn Pro Val Thr Thr Phe 
485 490 495 



Val Gin Pro Ala Glu Met Gly Asn Gin Ala Val Asn Ser Leu Ala Leu 
500 505 510 

lie Ser Ala Arg Arg Thr Ala Glu Ala Asn Asp Val Leu Ser Leu Leu 
515 520 525 

Leu Ala Ser His Leu Tyr Cys Thr Leu Gin Ala Val Asp Leu Arg Ala 
530 535 540 

Met Glu Leu Asp Phe Lys Lys Gin Phe Asp Pro Leu Leu Pro Thr Leu 
545 550 555 560 

Leu Gin Gin His Leu Gly Thr Gly Leu Asp Val Asn Ala Leu Ala Leu 
565 570 575 

Glu Val Lys Lys Ala Leu Asn Lys Arg Leu Glu Gin Thr Thr Thr Tyr 
580 585 590 

Asp Leu Glu Pro Arg Trp His Asp Ala Phe Ser Tyr Ala Thr Gly Thr 
595 600 605 

Val Val Glu Leu Leu Ser Ser Ser Pro Ser Ala Asn Val Thr Leu Thr 
610 615 620 

Ala Val Asn Ala Trp Lys Val Ala Ser Ala Glu Lys Ala lie Ser Leu 
625 630 635 640 

Thr Arg Glu Val Arg Asn Arg Phe Trp Gin Thr Pro Ser Ser Gin Ala 
645 650 655 

Pro Ala His Ala Tyr Leu Ser Pro Arg Thr Arg Val Leu Tyr Ser Phe 
660 665 670 

Val Arg Glu Glu Leu Gly Val Gin Ala Arg Arg Gly Asp Val Phe Val 
675 680 685 

Gly Val Gin Gin Glu Thr He Gly Ser Asn Val Ser Arg He Tyr Glu 
690 695 700 

Ala He Lys Asp Gly Arg He Asn His Val Leu Val Lys Met Leu Ala 
705 710 715 720 



<210> 14 
<211> 2311 



-11- 



<212> DNA 

<213> Amanita muscaria 



<220> 

<221> CDS 

<222> (18) . . (2237) 

<400> 14 

gtcgctcgca aatctaa atg ggt etc gat aac tec aag aac act gee aaa 
Met Gly Leu Asp Asn Ser Lys Asn Thr Ala Lys 
15 10 

ttt ttt gac eta cca aaa gcc gtc cat ggt atg aat ggt aca acc ccc 
Phe Phe Asp Leu Pro Lys Ala Val His Gly Met Asn Gly Thr Thr Pro 



gtc aat ggt ttt aaa gcg aca gcg ctt tec aag gcc tec cga aca atg 
Val Asn Gly Phe Lys Ala Thr Ala Leu Ser Lys Ala Ser Arg Thr Met 
30 35 40 

acc aag act age gca etc teg caa ttc tta gaa gcg tac egt gaa etc 
Thr Lys Thr Ser Ala Leu Ser Gin Phe Leu Glu Ala Tyr Arg Glu Leu 



gag ggc tac aag aat ggt aga gcc ate aag gtt gac ggt caa acg tta 
Glu Gly Tyr Lys Asn Gly Arg Ala lie Lys Val Asp Gly Gin Thr Leu 



tct att gca gcc gtc get gca get get cgc tac aat gcg gcc gtt gag 
Ser lie Ala Ala Val Ala Ala Ala Ala Arg Tyr Asn Ala Ala Val Glu 



ttg gac gaa tec cca ctt gtt aag gag cgc gtg agg aaa agt cag ctt 

Leu Asp Glu Ser Pro Leu Val Lys Glu Arg Val Arg Lys Ser Gin Leu 

95 100 105 

get ate gca aac aaa gta teg acc ggt gee age gta tae gga ctg tea 

Ala He Ala Asn Lys Val Ser Thr Gly Ala Ser Val Tyr Gly Leu Ser 

110 115 120 

act ggt tte ggt ggc agt get gat aca egg acg gac aaa ccg atg ttg 

Thr Gly Phe Gly Gly Ser Ala Asp Thr Arg Thr Asp Lys Pro Met Leu 

125 130 135 

ttg ggg ttt gee ctt ttg caa cac caa cat gta ggg ata ctg ccc acc 

Leu Gly Phe Ala Leu Leu Gin His Gin His Val Gly He Leu Pro Thr 

140 145 150 155 
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teg act gag cct ttg gac gtc eta cct etc caa gat gca aat aac aca 
Ser Thr Glu Pro Leu Asp Val Leu Pro Leu Gin Asp Ala Asn Asn Thr 
160 165 170 

age atg cea gag gcg tgg att cgc ggg gee att ttg ate egt atg aat 
Ser Met Pro Glu Ala Trp lie Arg Gly Ala lie Leu lie Arg Met Asn 
175 180 185 

teg eta att cgt ggc eac tct gga ate aga tgg gag ttg ate gaa aag 
Ser Leu lie Arg Gly His Ser Gly lie Arg Trp Glu Leu lie Glu Lys 
190 195 200 

atg aga gaa eta etc gcg gee aat gtg ata ect gtc gtt ecc etg aga 
Met Arg Glu Leu Leu Ala Ala Asn Val lie Pro Val Val Pro Leu Arg 
205 210 215 

ggc age ate tec tea tec gga gat etg tct ecc eta tee tat ate gca 
Gly Ser lie Ser Ser Ser Gly Asp Leu Ser Pro Leu Ser Tyr lie Ala 
220 225 230 235 

ggc acg att att ggc aac cca tea ate aag gta tat eac ggt cea tea 
Gly Thr lie lie Gly Asn Pro Ser lie Lys Val Tyr His Gly Pro Ser 
240 245 250 

aag tec gga att cge caa att gga tec teg aag gat gtc ttg get etg 
Lys Ser Gly lie Arg Gin lie Gly Ser Ser Lys Asp Val Leu Ala Leu 
255 260 265 

cat aat ate gaa cct ttc cca etg gaa teg aaa gaa cct ett ggt att 
His Asn lie Glu Pro Phe Pro Leu Glu Ser Lys Glu Pro Leu Gly lie 
270 275 280 

ttg aat ggg ace gca ttc teg gca tct gtg gca get tta gcc eta aac 
Leu Asn Gly Thr Ala Phe Ser Ala Ser Val Ala Ala Leu Ala Leu Asn 
285 290 295 

gaa got ate cat ett gtc ttg ttg get caa gtg tge acg get atg ggg 
Glu Ala He His Leu Val Leu Leu Ala Gin Val Cys Thr Ala Met Gly 
300 305 310 315 

aec gag gca ttg ata ggc act cgc get tet eat gca ecg ttc att eat 
Thr Glu Ala Leu lie Gly Thr Arg Ala Ser His Ala Pro Phe lie His 
320 325 330 

gcc aec gca cga cea eat cec ggt caa gta gaa tgt get gag aac att 
Ala Thr Ala Arg Pro His Pro Gly Gin Val Glu Cys Ala Glu Asn He 
335 340 345 
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tgg aat ttg etc gat ggg agt aaa ttg get cag tta gaa gag cac gaa 
Trp Asn Leu Leu Asp Gly Ser Lys Leu Ala Gin Leu Glu Glu His Glu 
350 355 360 

gtt cgc eta gaa gac gat aaa tac acc ctt egg cag gac cgt tat cca 
Val Arg Leu Glu Asp Asp Lys Tyr Thr Leu Arg Gin Asp Arg Tyr Pro 
365 370 375 

etc cga act teg ect caa ttc ctt ggg ect cag att gaa gac ata ate 
Leu Arg Thr Ser Pro Gin Phe Leu Gly Pro Gin lie Glu Asp lie lie 
380 385 390 395 

tec get ttc eag act gta aeg cag gag tgt aat tac tta cca get act 
Ser Ala Phe Gin Thr Val Thr Gin Glu Cys Asn Tyr Leu Pro Ala Thr 
400 405 410 

gae aat eca ctg att gat ggg gag act ggc gaa tct eae cac ggt ggc 
Asp Asn Pro Leu lie Asp Gly Glu Thr Gly Glu Ser His His Gly Gly 
415 420 425 

aat tte caa geg atg get gta act aat gea atg gag aag acg cga ctt 
Asn Phe Gin Ala Met Ala Val Thr Asn Ala Met Glu Lys Thr Arg Leu 
430 435 440 

get tta eat cac gtt ggc aaa tta eta ttt tec cag age act gaa tta 
Ala Leu His His Val Gly Lys Leu Leu Phe Ser Gin Ser Thr Glu Leu 
445 450 . 455 

gtc aat ect geg atg aae cgc ggt ctg ccg ect tea gta get gee aca 
Val Asn Pro Ala Met Asn Arg Gly Leu Pro Pro Ser Val Ala Ala Thr 
460 465 470 475 

gat cca tet etc aae tac cac gee aaa gga eta gac ata gea act geg 
Asp Pro Ser Leu Asn Tyr His Ala Lys Gly Leu Asp lie Ala Thr Ala 
480 485 490 

gcc tac gta gee gaa geg act ect ggc ccc act cac att cag teg gea 
Ala Tyr Val Ala Glu Ala Thr Pro Gly Pro Thr His He Gin Ser Ala 
495 500 505 

gaa atg eae aac caa get gtt aac tec ctg geg ttg att tct get egg 
Glu Met His Asn Gin Ala Val Asn Ser Leu Ala Leu He Ser Ala Arg 
510 515 520 

get acc ate aca teg ttg gaa gtg eta aca tet ctg ate geg tct tac 
Ala Thr He Thr Ser Leu Glu Val Leu Thr Ser Leu He Ala Ser Tyr 
525 530 535 
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ttg tat att eta tgc caa get etc gac etc cgt gee ett eag cge gag 
Leu Tyr He Leu Cys Gin Ala Leu Asp Leu Arg Ala Leu Gin Arg Glu 
540 545 550 555 

ttc ttg ccc ggt eta gac ate ate att cgt gag gag tta aga teg tea 
Phe Leu Pro Gly Leu Asp He He He Arg Glu Glu Leu Arg Ser Ser 
560 565 570 

ttt gga tct ttc ctg tea tea gaa eag atg gag aaa ttg caa caa aat 
Phe Gly Ser Phe Leu Ser Ser Glu Gin Met Glu Lys Leu Gin Gin Asn 
575 580 585 

eta act agt gca ttt gaa gat cat ett gac aag ace acg aca atg gat 
Leu Thr Ser Ala Phe Glu Asp His Leu Asp Lys Thr Thr Thr Met Asp 
590 595 600 

aat act gat cga atg act aeg atg get get aca tea tea tea gtt eta 
Asn Thr Asp Arg Met Thr Thr Met Ala Ala Thr Ser Ser Ser Val Leu 
605 610 615 

ett caa ttc ttt act gat tct ggc gcg tct gtt cot cce teg tct tgc 
Leu Gin Phe Phe Thr Asp Ser Gly Ala Ser Val Pro Pro Ser Ser Cys 
620 625 630 635 

gat ett etc tec agt gtc teg tec tte caa tct tct gtg gcg aca egg 
Asp Leu Leu Ser Ser Val Ser Ser Phe Gin Ser Ser Val Ala Thr Arg 
640 645 650 

tct tea gtt etc atg gat gac eta egg aaa gaa tat att ttt gga gac 
Ser Ser Val Leu Met Asp Asp Leu Arg Lys Glu Tyr He Phe Gly Asp 



cgt ggc ccc acg ccc gca age caa tac ate gga aag aca egg eca gta 
Arg Gly Pro Thr Pro Ala Ser Gin Tyr He Gly Lys Thr Arg Pro Val 
670 675 680 

tac caa ttc att aga aea act ata ggc gtt cgt aag cat ggt tct gag 
Tyr Gin Phe He Arg Thr Thr He Gly Val Arg Lys His Gly Ser Glu 
685 690 695 

aac tac aac aag ttt tat aat ggg ctg ggt gtc gaa gac gtt ace ate 
Asn Tyr Asn Lys Phe Tyr Asn Gly Leu Gly Val Glu Asp Val Thr He 
700 705 710 715 

ggt caa aat ata tea cgc ata tac gag tea ate egg gac ggc aaa atg 
Gly Gin Asn He Ser Arg He Tyr Glu Ser He Arg Asp Gly Lys Met 
720 725 730 
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caa tec att att gtc teg ttg ttt gat taggtcttga aagcttgtat 
Gin Ser lie lie Val Ser Leu Phe Asp 
735 740 

cttattaata accatacact tcctcgaggt ctaaaaaaaa aaaaaaaaaa aaaa 



<210> 15 
<211> 740 
<212> PRT 

<213> Amanita muscaria 
<400> 15 

Met Gly Leu Asp Asn Ser Lys Asn Thr Ala Lys Phe Phe Asp Leu Pro 



Lys Ala Val His Gly Met Asn Gly Thr Thr Pro Val Asn Gly Phe Lys 
20 25 30 

Ala Thr Ala Leu Ser Lys Ala Ser Arg Thr Met Thr Lys Thr Ser Ala 
35 40 45 

Leu Ser Gin Phe Leu Glu Ala Tyr Arg Glu Leu Glu Gly Tyr Lys Asn 
50 55 60 

Gly Arg Ala lie Lys Val Asp Gly Gin Thr Leu Ser lie Ala Ala Val 
65 70 75 80 

Ala Ala Ala Ala Arg Tyr Asn Ala Ala Val Glu Leu Asp Glu Ser Pro 



Leu Val Lys Glu Arg Val Arg Lys Ser Gin Leu Ala lie Ala Asn Lys 
100 105 110 

Val Ser Thr Gly Ala Ser Val Tyr Gly Leu Ser Thr Gly Phe Gly Gly 
115 120 125 

Ser Ala Asp Thr Arg Thr Asp Lys Pro Met Leu Leu Gly Phe Ala Leu 
130 135 140 

Leu Gin His Gin His Val Gly He Leu Pro Thr Ser Thr Glu Pro Leu 
145 150 155 160 

Asp Val Leu Pro Leu Gin Asp Ala Asn Asn Thr Ser Met Pro Glu Ala 
165 170 175 

Trp He Arg Gly Ala He Leu He Arg Met Asn Ser Leu He Arg Gly 
180 185 190 
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His Ser Gly lie Arg Trp Glu Leu lie Glu Lys Met Arg Glu Leu Leu 
195 200 205 



Ala Ala Asn Val lie Pro Val Val Pro Leu Arg Gly Ser lie Ser Ser 
210 215 220 



Ser Gly Asp Leu Ser Pro Leu Ser Tyr lie Ala Gly Thr lie lie Gly 
225 230 235 240 



Asn Pro Ser lie Lys Val Tyr His Gly Pro Ser Lys Ser Gly lie Arg 
245 250 255 



Gin lie Gly Ser Ser Lys Asp Val Leu Ala Leu His Asn He Glu Pn 
260 265 270 



Phe Pro Leu Glu Ser Lys Glu Pro Leu Gly He Leu Asn Gly Thr Ala 
275 280 285 



Phe Ser Ala Ser Val Ala Ala Leu Ala Leu Asn Glu Ala He His Leu 
290 295 300 



Val Leu Leu Ala Gin Val Cys Thr Ala Met Gly Thr Glu Ala Leu He 
305 310 315 320 



Gly Thr Arg Ala Ser His Ala Pro Phe He His Ala Thr Ala Arg Pro 
325 330 335 



His Pro Gly Gin Val Glu Cys Ala Glu Asn He Trp Asn Leu Leu Asp 
340 345 ■ 350 



Gly Ser Lys Leu Ala Gin Leu Glu Glu His Glu Val Arg Leu Glu Asp 
355 360 365 



Asp Lys Tyr Thr Leu Arg Gin Asp Arg Tyr Pro Leu Arg Thr Ser Pro 
370 375 380 



Gin Phe Leu Gly Pro Gin He Glu Asp He He Ser Ala Phe Gin Thr 
385 390 395 400 



Val Thr Gin Glu Cys Asn Tyr Leu Pro Ala Thr Asp Asn Pro Leu He 
405 410 415 



Asp Gly Glu Thr Gly Glu Ser His His Gly Gly Asn Phe Gin Ala Met 
420 425 430 



Ala Val Thr Asn Ala Met Glu Lys Thr Arg Leu Ala Leu His His Val 
435 440 445 
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Gly Lys Leu Leu Phe Ser Gin Ser Thr Glu Leu Val Asn Pro Ala Met 
450 455 460 



Asn Arg Gly Leu Pro Pro Ser Val Ala Ala Thr Asp Pro Ser Leu Asn 
465 470 475 480 

Tyr His Ala Lys Gly Leu Asp He Ala Thr Ala Ala Tyr Val Ala Glu 
485 490 495 

Ala Thr Pro Gly Pro Thr His He Gin Ser Ala Glu Met His Asn Gin 
500 505 510 

Ala Val Asn Ser Leu Ala Leu He Ser Ala Arg Ala Thr He Thr Ser 
515 520 525 

Leu Glu Val Leu Thr Ser Leu He Ala Ser Tyr Leu Tyr He Leu Cys 
530 535 540 

Gin Ala Leu Asp Leu Arg Ala Leu Gin Arg Glu Phe Leu Pro Gly Leu 
545 550 555 560 

Asp He He He Arg Glu Glu Leu Arg Ser Ser Phe Gly Ser Phe Leu 
565 570 575 

Ser Ser Glu Gin Met Glu Lys Leu Gin Gin Asn Leu Thr Ser Ala Phe 
580 585 590 

Glu Asp His Leu Asp Lys Thr Thr Thr Met Asp Asn Thr Asp Arg Met 
595 600 605 

Thr Thr Met Ala Ala Thr Ser Ser Ser Val Leu Leu Gin Phe Phe Thr 
610 615 620 

Asp Ser Gly Ala Ser Val Pro Pro Ser Ser Cys Asp Leu Leu Ser Ser 
625 630 635 640 

Val Ser Ser Phe Gin Ser Ser Val Ala Thr Arg Ser Ser Val Leu Met 
645 650 655 

Asp Asp Leu Arg Lys Glu Tyr He Phe Gly Asp Arg Gly Pro Thr Pro 
660 655 670 

Ala Ser Gin Tyr He Gly Lys Thr Arg Pro Val Tyr Gin Phe He Arg 
675 680 685 

Thr Thr He Gly Val Arg Lys His Gly Ser Glu Asn Tyr Asn Lys Phe 
690 695 700 



Tyr Asn Gly Leu Gly Val Glu Asp Val Thr He Gly Gin Asn He Ser 
705 710 715 720 

Arg He Tyr Glu Ser He Arg Asp Gly Lys Met Gin Ser He He Val 
725 730 735 

Ser Leu Phe Asp 
740 



<210> 16 
<211> 2787 
<212> DNA 

<213> Rhodotorula mucilaginosa 

<220> 
<221> CDS 

<222> (646) . . (2784) 
<400> 16 

ctgcagaatc gccgcgacgt gacctagacc tctcttttcg ccctatcgcc ctctcgcaca 60 

tatgtcctgt gccgcctttc tgtccctcct ggactgtatc atacrctgtg gccccttgcc 120 

tcggcttgtt cctctcctct atagacatgg gtactgatca cttcgtttgt tgttctcgtt 180 

ttctcttgca agtacgaccc atcctttttt ctcgcgatcg acttcaatat cggtgcgctt 240 

gcactacgtc tcttcgccag tgcacagtat gacgggaggg gacacgactg gcggcgaaag 300 

cggagtcgtc gttgccggtc ccgagagacg ggaacacttt cttccgcctt ccagaggccg 360 

tactccggtg atttgacatc gcactatgct tggtggggcg cccgaactcg gagcttgcga 420 

tgtgcctgaa gcagagctcg gcaggcgaca tggcgactcc gccacattcg ggtcaaagcc 480 

gaagtggggc ctcggacgtt ccgaacgtcg tcttgccgcc gccgcgttgc gtccgccgtc 540 

gactttgtcc tcgtgtgctc acgcctcctt ctttctctct tctgctttcc tcacttcact 600 

ctgcaagtcc cgcctcgcat ccacccaacc cgagcagctc tcaag atg gcc ccc tec 657 

Met Ala Pro Ser 



gtc gac teg ate gcg aet teg gtt gee aac tec etc teg aac ggg ttg 705 
Val Asp Ser He Ala Thr Ser Val Ala Asn Ser Leu Ser Asn Gly Leu 
5 10 15 20 



-19- 



cac gcc gcc gcc gcc gcc aac ggt ggc gac gtc cac aag aag acg gcc 
His Ala Ala Ala Ala Ala Asn Gly Gly Asp Val His Lys Lys Thr Ala 
25 30 35 

ggt get ggc tec etc etc ccg acc acc gag acg acc cag etc gac ate 
Gly Ala Gly Ser Leu Leu Pro Thr Thr Glu Thr Thr Gin Leu Asp lie 



gtt gag cgc ate ttg gcc gac gcc ggc gcg acg gac cag ate aaa etc 
Val Glu Arg He Leu Ala Asp Ala Gly Ala Thr Asp Gin He Lys Leu 



gat ggg tac acc etc acg etc ggc gac gtc gtc ggc get get cgc cgt 
Asp Gly Tyr Thr Leu Thr Leu Gly Asp Val Val Gly Ala Ala Arg Arg 



ggc cgc tec gtc aag gtc gca gac age ccg cac ate cgc gag aag ate 
Gly Arg Ser Val Lys Val Ala Asp Ser Pro His He Arg Glu Lys He 
85 90 95 100 

gat gcc agt gtc gag ttc etc cgt act cag etc gac aac agt gtc tac 
Asp Ala Ser Val Glu Phe Leu Arg Thr Gin Leu Asp Asn Ser Val Tyr 
105 110 115 

ggt gtc acg act ggt ttc ggc ggc teg gcc gac acc egg act gag gat 
Gly Val Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr Arg Thr Glu Asp 
120 125 130 

gcg ate teg etc caa aag gcc ctg etc gag cac cag etc tgc ggt gtc 
Ala He Ser Leu Gin Lys Ala Leu Leu Glu His Gin Leu Cys Gly Val 
135 140 145 

etc CCG acc teg atg gat ggc ttt gcg etc ggt cgc ggc etc gag aac 
Leu Pro Thr Ser Met Asp Gly Phe Ala Leu Gly Arg Gly Leu Glu Asn 
150 155 160 

teg ctt ccg etc gaa gtc gtc cga ggc gcg atg acc ate cgt gtc aac 
Ser Leu Pro Leu Glu Val Val Arg Gly Ala Met Thr He Arg Val Asn 
165 170 175 180 

teg etc act cgc ggt cac teg gcg gtc cgc ate gtc gtc etc gaa gee 
Ser Leu Thr Arg Gly His Ser Ala Val Arg He Val Val Leu Glu Ala 
185 190 195 

etc acc aac ttc etc aac cac ggc ate acc ccg ate gtc ccg ctt cga 
Leu Thr Asn Phe Leu Asn His Gly He Thr Pro He Val Pro Leu Arg 
200 205 210 
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ggc acc ate teg gcg teg gge gac ctt tec ccc etc tct tac ate gee 
Gly Thr lie Ser Ala Ser Gly Asp Leu Ser Pro Leu Ser Tyr lie Ala 
215 220 225 

gee teg ate aec ggc cae ecg gae teg aag gte cac gtc gac ggc aag 
Ala Ser lie Thr Gly His Pro Asp Ser Lys Val His Val Asp Gly Lys 
230 235 240 

ate atg tec gcc cag gag gcg ate gcg etc aag ggt ctt cag ccc gtc 
lie Met Ser Ala Gin Glu Ala lie Ala Leu Lys Gly Leu Gin Pro Val 
245 250 255 260 

gtc etc ggt ecg aag gag ggt ete ggt etc gte aac gge aeg gee gtc 
Val Leu Gly Pro Lys Glu Gly Leu Gly Leu Val Asn Gly Thr Ala Val 
265 270 275 

tec gee teg atg gcg acg ctg gcc etc ace gae gea cac gte etc teg 
Ser Ala Ser Met Ala Thr Leu Ala Leu Thr Asp Ala His Val Leu Ser 
280 285 290 

etc etc gca cag gcg etc act get ctt act gtc gag gee atg gtc gga 
Leu Leu Ala Gin Ala Leu Thr Ala Leu Thr Val Glu Ala Met Val Gly 
295 300 305 

cac gcc ggc teg ttc cac cca ttc etc cac gac gtc acg cgc ect cac 
His Ala Gly Ser Phe His Pro Phe Leu His Asp Val Thr Arg Pro His 
310 315 320 

cog acc cag ate gag gtg gcg cgc aac ate egg act ctt etc gag ggc 
Pro Thr Gin lie Glu Val Ala Arg Asn lie Arg Thr Leu Leu Glu Gly 
325 330 335 340 

age aag tac gcc gtc cac cac gag act gaa gte aag gte aag gac gae 
Ser Lys Tyr Ala Val His His Glu Thr Glu Val Lys Val Lys Asp Asp 
345 350 355 

gag ggc ate etc agg cag gac egg tac ecg etc cgc tgc teg ecg cag 
Glu Gly lie Leu Arg Gin Asp Arg Tyr Pro Leu Arg Cys Ser Fro Gin 
360 365 370 

tgg etc ggt ccc ctt gtc age gae atg att cac get cac get gtc etc 
Trp Leu Gly Pro Leu Val Ser Asp Met He His Ala His Ala Val Leu 
375 380 385 

teg etc gag get ggt cag teg acc acc gae aac ecg ctg ate gac etc 
Ser Leu Glu Ala Gly Gin Ser Thr Thr Asp Asn Pro Leu He Asp Leu 
390 ■ 395 400 
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gag aac aag atg acc cac cat ggc gga gcc ttc atg gcg age age gtc 
Glu Asn Lys Met Thr His His Gly GLy Ala Phe Met Ala Ser Ser Val 
405 410 415 420 

gga aac acg atg gag aag act cgc etc gcc gtc gcg ctg atg ggc aag 
Gly Asn Thr Met Glu Lys Thr Arg Leu Ala \^al Ala Leu Met Gly Lys 
425 430 435 

gtc age ttt act cag etc acc gag atg etc aac gcc ggc atg aac egg 
Val Ser Phe Thr Gin Leu Thr Glu Met Leu Asn Ala Gly Met Asn Arg 
440 445 450 

gcc ctt ccg tec tgc etc get gcc gag gac cct tec etc tct tat cac 
Ala Leu Pro Ser Cys Leu Ala Ala Glu Asp Pro Ser Leu Ser Tyr His 
455 460 465 

tgc aag ggt etc gac att get gcg gcc gcc tac act tec gag etc ggt 
Cys Lys Gly Leu Asp He Ala Ala Ala Ala Tyr Thr Ser Glu Lea Gly 
470 475 480 

cac ctt gcc aac ccg gtt teg acc cac gtc cag ccg gcc gag atg ggc 
His Leu Ala Asn Pro Val Ser Thr His Val Gin Pro Ala Glu Met Gly 
485 490 495 500 

aac cag gcc ate aac teg etc gcc etc ate teg gcc cgc cgc acc gcc 
Asn Gin Ala He Asn Ser Leu Ala Leu He Ser Ala Arg Arg Thr Ala 
505 510 515 

gag gcg aac gac gtt etc tec etc etc etc gcc acc cac etc tac tgc 
Glu Ala Asn Asp Val Leu Ser Leu Leu Leu Ala Thr His Leu Tyr Cys 
520 525 530 

gtc etc cag gee gtc gac etc cgc gcg atg gag ttt gag cac acc aag 
Val Leu Gin Ala Val Asp Leu Arg Ala Met Glu Phe Glu His Thr Lys 
535 540 545 

gcg ttc gag ccg atg gtc act gag ctg ttg aag cag cac ttt ggc gcg 
■ Ala Phe Glu Pro Met Val Thr Glu Leu Leu Lys Gin His Phe Gly Ala 
550 555 560 

etc gcg acg gcc gaa gtc gag gac aag gtc cgc aag teg ate tac aag 
Leu Ala Thr Ala Glu Val Glu Asp Lys Val Arg Lys Ser He Tyr Lys 
565 570 575 580 

egg ttg cag cag aac aac teg tac gac etc gag cag egg tgg cac gac 
Arg Leu Gin Gin Asn Asn Ser Tyr Asp Leu Glu Gin Arg Trp His Asp 
585 590 595 
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acg ttc teg gtc gcg acc ggt gcc gtc gtc gag gcg etc gcc ggc cag 
Thr Phe Ser Val Ala Thr Gly Ala Val Val Glu Ala Leu Ala Gly Gin 
600 605 610 

gag gtc teg etc gcg age etc aac gcc tgg aag gtc gcc tgc gcc gag 
Glu Val Ser Leu Ala Ser Leu Asn Ala Trp Lys Val Ala Cys Ala Glu 
615 620 625 

aag get ate gcg etc aeg cgc tec gtc cge gac tog tte tgg gcg get 
Lys Ala He Ala Leu Thr Arg Ser Val Arg Asp Ser Phe Trp Ala Ala 
630 635 640 

eeg teg teg teg teg cee gcg etc aag tac etc tec ecg egg aeg ege 
Pro Ser Ser Ser Ser Pro Ala Leu Lys Tyr Leu Ser Pro Arg Thr Arg 
645 650 655 660 

gtc ctg tat teg ttc gtc egg gag gag gtc ggc gtc aag gcc cge cgc 
■Val Leu Tyr Ser Phe Val Arg Glu Glu Val Gly Val Lys Ala Arg Arg 
665 570 675 

ggc gat gtc tac etc ggc aag cag gag gtc acg ate ggc acc aac gtc 
Gly Asp Val Tyr Leu Gly Lys Gin Glu Val Thr He Gly Thr Asn Val 
680 685 690 

age cgc ate tac gag gcg ate aag age ggt tgc ate gee ecc gtc etc 
Ser Arg He Tyr Glu Ala He Lys Ser Gly Cys He Ala Pio Val Leu 
695 700 705 

gtc aag atg atg gea tag 
Val Lys Met Met Ala 
710 



<210> 17 
<211> 713 
<212> PRT 

<213> Rhodotorula mucilaginosa 
<400> 17 

Met Ala Pro Ser Val Asp Ser He Ala Thr Ser Val Ala Asn Ser Leu 
15 10 15 

Ser Asn Gly Leu His Ala Ala Ala Ala Ala Asn Gly Gly Asp Val His 
20 25 30 

Lys Lys Thr Ala Gly Ala Gly Ser Leu Leu Pro Thr Thr Glu Thr Thr 
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Gin Leu Asp He Val Glu Arg He Leu Ala Asp Ala Gly Ala Thr Asp 
50 55 60 

Gin He Lys Leu Asp Gly Tyr Thr Leu Thr Leu Gly Asp Val Val Gly 
65 70 75 80 



Ala Ala Arg Arg Gly Arg Ser Val Lys Val Ala 



Ser Pro His He 



Arg Glu Lys He Asp Ala Ser Val Glu Phe Leu Arg Thr Gin Leu Asp 
100 105 110 

Asn Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr 
115 120 125 

Arg Thr Glu Asp Ala He Ser Leu Gin Lys Ala Leu Leu Glu His Gin 
130 135 140 

Leu Cys Gly Val Leu Pro Thr Ser Met Asp Gly Phe Ala Leu Gly Arg 
145 150 155 160 

Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly Ala Met Thr 
165 170 175 

He Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val Arg He Val 
180 185 190 

Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly He Thr Pro He 
195 200 205 

Val Pro Leu Arg Gly Thr He Ser Ala Ser Gly Asp Leu Ser Pro Leu 
210 215 220 

Ser Tyr He Ala Ala Ser He Thr Gly His Pro Asp Ser Lys Val His 
225 230 235 240 

Val Asp Gly Lys He Met Ser Ala Gin Glu Ala He Ala Leu Lys Gly 
245 250 255 

Leu Gin Pro Val Val Leu Gly Pro Lys Glu Gly Leu Gly Leu Val Asn 
260 265 270 

Gly Thr Ala Val Ser Ala Ser Met Ala Thr Leu Ala Leu Thr Asp Ala 
275 280 285 

His Val Leu Ser Leu Leu Ala Gin Ala Leu Thr Ala Leu Thr Val Glu 
290 295 300 
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Ala Met Val Gly His Ala Gly Ser Phe His Pro Phe Leu His Asp Val 
305 310 315 320 

Thr Arg Pro His Pro Thr Gin He Glu Val Ala Arg Asn He Arg Thr 
325 330 335 

Leu Leu Glu Gly Ser Lys Tyr Ala ^7al His His Glu Thr Glu Val Lys 
340 345 350 

Val Lys Asp Asp Glu Gly He Leu Arg Gin Asp Arg Tyr Pro Leu Arg 
355 360 365 

Cys Ser Pro Gin Trp Leu Gly Pro Leu Val Ser Asp Met He His Ala 
370 375 380 

His Ala Val Leu Ser Leu Glu Ala Gly Gin Ser Thr Thr Asp Asn Pro 
385 390 395 400 

Leu He Asp Leu Glu Asn Lys Met Thr His His Gly Gly Ala Phe Met 
405 410 415 

Ala Ser Ser Val Gly Asn Thr Met Glu Lys Thr Arg Leu Ala Val Ala 
420 425 430 

Leu Met Gly Lys Val Ser Phe Thr Gin Leu Thr Glu Met Leu Asn Ala 
435 440 445 

Gly Met Asn Arg Ala Leu Pro Ser Cys Leu Ala Ala Glu Asp Pro Ser 
450 455 460 

- Leu Ser Tyr His Cys Lys Gly Leu Asp He Ala Ala Ala Ala Tyr Thr 
465 470 475 480 

Ser Glu Leu Gly His Leu Ala Asn Pro Val Ser Thr His Val Gin Pro 
485 490 495 

. Ala Glu Met Gly Asn Gin Ala He Asn Ser Leu Ala Leu He Ser Ala 
500 505 510 

Arg Arg Thr Ala Glu Ala Asn Asp Val Leu Ser Leu Leu Leu Ala Thr 
515 520 525 

His Leu Tyr Cys Val Leu Gin Ala Val Asp Leu Arg Ala Met Glu Phe 
530 535 540 

Glu His Thr Lys Ala Phe Glu Pro Met Val Thr Glu Leu Leu Lys Gin 
545 550 555 560 
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His Phe Gly Ala 



Ser lie Tyr Lys 
580 

Arg Trp His Asp 
595 

Leu Ala Gly Gin 
610 

Ala Cys Ala Glu 
625 

Phe Trp Ala Ala 



Pro Arg Thr Arg 
660 

Lys Ala Arg Arg 
675 

Gly Thr Asn Val 
690 

■Ala Pro Val Leu 
705 



Leu Ala Thr Ala 
565 

Arg Leu Gin Gin 



Thr Phe Ser Val 
600 

Glu Val Ser Leu 
615 

Lys Ala lie Ala 
630 

Pro Ser Ser Ser 
645 

Val Leu Tyr Ser 



Gly Asp Val Tyr 
680 

Ser Arg lie Tyr 

695 

Val Lys Met Met 
710 



Glu Val Glu Asp 
570 

Asn Asn Ser Tyr 
585 

Ala Thr Gly Ala 



Ala Ser Leu Asn 
620 

Leu Thr Arg Ser 
635 

Ser Pro Ala Leu 
650 

Phe Val Arg Glu 
665 

Leu Gly Lys Gin 



Glu Ala He Lys 
700 

Ala 



Lys Val Arg Lys 
575 

Asp Leu Glu Gin 
590 

Val Val Glu Ala 
605 

Ala Trp Lys Val 



Val Arg Asp Ser 
640 

Lys Tyr Leu Ser 
655 

Glu Val Gly Val 
670 

Glu Val Thr He 
685 

Ser Gly Cys He 



<210> 18 
<211> 2439 
<212> DNA 

<213> Rhodotorula toruloides 

<220> 

<221> CDS 

<222> (1) . . (2148} 

<400> 18 

atg gca ccc teg etc gac teg ate 
Met Ala Pro Ser Leu Asp Ser He 
1 5 

gca tec gca aag cag get gtc aat 
Ala Ser Ala Lys Gin Ala Val Asn 
20 



teg cae teg ttc gca aac ggc gtc 48 
Ser His Ser Phe Ala Asn Gly Val 
10 15 

ggc gee teg ace aac etc gca gtc 96 
Gly Ala Ser Thr Asn Leu Ala Val 
25 30 
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gca ggc teg cac ctg ccc aca acc cag gtc acg cag gtc gac ate gtc 
Ala Gly Ser His Leu Pro Thr Thr Gin Val Thr Gin Val Asp He Val 



gag aag atg etc gcc gcg ccg acc gac teg acg etc gaa etc gae gge 
Glu Lys Met Lea Ala Ala Pro Thr Asp Ser Thr Leu Glu Leu Asp Gly 
50 55 60 

tac teg etc aac etc gga gac gtc gtc teg gcc gcg agg aag gge agg 
Tyr Ser Leu Asn Leu Gly Asp Val Val Ser Ala Ala Arg Lys Gly Arg 



cct gtc cgc gtc aag gac age gae gag ate cge tea aag att gac aaa 
Pro Val Arg Val Lys Asp Ser Asp Glu lie Arg Ser Lys He Asp Lys 



teg gtc gag ttc ttg cgc teg caa etc tee atg age gtc tac gge gte 
Ser Val Glu Phe Leu Arg Ser Gin Leu Ser Met Ser Val Tyr Gly Val 
100 105 110 

aeg act gga ttt gge gga tec gca gae ace cge acc gag gae gee ate 
Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr Arg Thr Glu Asp Ala He 
115 120 125 

teg etc cag aag get etc etc gag eae cag etc tgc ggt gtt etc cct 
Ser Leu Gin Lys Ala Leu Leu Glu His Gin Leu Cys Gly Val Leu Pro 
130 135 140 

teg teg ttc gae teg ttc cgc etc ggc cgc ggt etc gag aac teg ett 
Ser Ser Phe Asp Ser Phe Arg Leu Gly Arg Gly Leu Glu Asn Ser Leu 
145 150 155 160 

ccc etc gag gtt gtt cgc ggc gcc atg aca ate cgc gtc aac age ttg 
Pro Leu Glu Val Val Arg Gly Ala Met Thr He Arg Val Asn Ser Leu 
155 170 175 

acc cgc gge eac teg get gtc cgc etc gtc gtc etc gag gcg etc ace 
Thr Arg Gly His Ser Ala Val Arg Leu Val Val Leu Glu Ala Leu Thr 
180 185 190 

aac ttc etc aac cac ggc ate ace eee ate gtc ccc etc cgc ggc acc 
Asn Phe Leu Asn His Gly He Thr Pro He Val Pro Leu Arg Gly Thr 
195 200 205 

ate tet gcg teg gge gae etc tet cct etc tec tac att gea gcg gee 
He Ser Ala Ser Gly Asp Leu Ser Pro Leu Ser Tyr He Ala Ala Ala 
210 215 220 
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ate age ggt cac ccg gac age aag gtg cac gtc gtc cac gag ggc aag 
lie Ser Gly His Pro Asp Ser Lys Val His Val Val His Glu Gly Lys 
225 230 235 240 

gag aag ate etg tae gee cgc gag gcg atg geg etc ttc aac cte gag 
Glu Lys lie Leu Tyr Ala Arg Glu Ala Met Ala Leu Phe Asn Leu Glu 
245 250 255 

CGC gtc gtc etc ggc ccg aag gaa ggt etc ggt etc gtc aac ggc ace 
Pro Val Val Leu Gly Pro Lys Glu Gly Leu Gly Leu Val Asn Gly Thr 
260 265 270 

gee gtc tea gca teg atg gee aec ete get etg eae gae gea eae atg 
.Ala Val Ser Ala Ser Met Ala Thr Leu Ala Leu His Asp Ala His Met 
275 280 285 

etc teg etc etc teg cag teg etc acg gee atg acg gtc gaa gcg atg 
Leu Ser Leu Leu Ser Gin Ser Leu Thr Ala Met Thr Val Glu Ala Met 
290 295 300 

gte gge eae gee ggc teg tte eae cce tte ett cac gac gtc acg cgc 
- Val Gly His Ala Gly Ser Phe His Pro Phe Leu His Asp Val Thr Arg 
305 310 315 320 

eet cac ccg acg cag ate gaa gtc gcg gga aac ate cge aag ete ete 
Pro His Pro Thr Gin lie Glu Val Ala Gly Asn lie Arg Lys Leu Leu 
325 330 335 

gag gga age cgc ttt get gtc cac cat gag gag gag gtc aag gtc aag 
Glu Gly Ser Arg Phe Ala Val His His Glu Glu Glu Val Lys Val Lys 
340 345 350 

gac gac gag gge att etc cgc cag gae cgc cac cce ttg cgc acg tet 
Asp Asp Glu Gly lie Leu Arg Gin Asp Arg Tyr Pro Leu Arg Thr Ser 
355 360 365 

cet cag tgg etc gge ccg etc gtc age gac etc att cac gee cac gee 
Pro Gin Trp Leu Gly Pro Leu Val Ser Asp Leu lie His Ala His Ala 
370 375 380 

gtc ete ace ate gag gee ggc cag teg aeg ace gac aac eet etc ate 
. Val Leu Thr lie Glu Ala Gly Gin Ser Thr Thr Asp Asn Pro Leu lie 
385 390 395 400 

.gac gtc gag aac aag act teg cac eae gge ggc aat ttc cag get gee 
Asp Val Glu Asn Lys Thr Ser His His Gly Gly Asn Phe Gin Ala Ala 
405 410 415 
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get gtg gcc aac acc atg gag aag act cgc etc ggg etc gee cag ate 
Ala Val Ala Asn Thr Met Glu Lys Thr Arg Leu Gly Leu Ala Gin lie 
420 425 430 

ggc aag etc aac ttc acg cag etc acc gag atg etc aac gcc gge atg 
Gly Lys Leu Asn Phe Thr Gin Leu Thr Glu Met Leu Asn Ala Gly Met 
435 440 445 

aac cgc ggc etc ecc tee tgc etc geg gcc gaa gac cce teg etc tec 
Asn Arg Gly Leu Pro Ser Cys Leu Ala Ala Glu Asp Pro Ser Leu Ser 
450 455 460 

tae cac tgc aag ggc etc gac ate gee get gcg geg tac acc teg gag 
Tyr His Cys Lys Gly Leu Asp lie Ala Ala Ala Ala Tyr Thr Ser Glu 
465 470 475 480 

ttg gga cac etc gcc aac act gtg acg acg cat gtc cag ccg get gag 
Leu Gly His Leu Ala Asn Pro Val Thr Thr His Val Gin Pro Ala Glu 
485 490 495 

atg geg aac eag gcg gtc aac teg ett gcg etc ate teg get egt cgc 
Met Ala Asn Gin Ala Val Asn Ser Leu Ala Leu lie Ser Ala Arg Arg 
500 505 510 

acg acc gag tec aac gac gtc ctt tct etc etc etc gcc acc cac etc 
Thr Thr Glu Ser Asn Asp Val Leu Ser Leu Leu Leu Ala Thr His Leu 
515 520 525 

tae tgc gtt etc caa gcc ate gac ttg cgc gcg ate gag ttc gag ttc 
Tyr Cys Val Leu Gin Ala lie Asp Leu Arg Ala lie Glu Phe Glu Phe 
530 535 540 

aag aag cag ttc ggc cca gcc ate gtc teg etc ate gac cag cac ttt 
Lys Lys Gin Phe Gly Pro Ala lie Val Ser Leu lie Asp Gin His Phe 
545 550 555 560 

ggc tee gee atg acc ggc teg aac ctg cgc gac gag etc gtc gag aag 
Gly Ser Ala Met Thr Gly Ser Asn Leu Arg Asp Glu Leu Val Glu Lys 
565 570 575 

gtg aac aag acg etc gcc aag cgc etc gag cag ace aac teg tae gac 
Val Asn Lys Thr Leu Ala Lys Arg Leu Glu Gin Thr Asn Ser Tyr Asp 
580 585 590 

etc gtc ccg cgc tgg cac gac gcc ttc tec ttc gcc gcc gge acc gtc 
Leu Val Pro Arg Trp His Asp Ala Phe Ser Phe Ala Ala Gly Thr Val 
595 600 605 
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gtc gag gtc etc teg teg aeg teg etc teg etc gee gee gte aac gee 1872 
Val Glu Val Leu Ser Ser Thr Ser Leu Ser Leu Ala Ala Val Asn Ala 
610 615 620 

tgg aag gtc gee gcc gee gag teg gcc ate teg etc aec cgc caa gtc 1920 
Trp Lys Val Ala Ala Ala Glu Ser Ala lie Ser Leu Thr Arg Gin Val 
625 630 635 640 

cgc gag ace ttc tgg tec gcc gcg teg aec teg teg cce geg etc teg 1968 
Arg Glu Thr Phe Trp Ser Ala Ala Ser Thr Ser Ser Pro Ala Leu Ser 
645 650 655 

tac etc teg eeg cgc act eag ate etc tae gcc ttc gte cgc gag gag 2016 
Tyr Leu Ser Pro Arg Thr Gin lie Leu Tyr Ala Phe Val Arg Glu Glu 
660 665 670 

ett ggc gtc aag gee cgc cgc gga gae gtc ttc etc ggc aag eaa gag 2064 
Leu Gly Val Lys Ala Arg Arg Gly Asp Val Phe Leu Gly Lys Gin Glu 
675 680 685 

gtg acg ate ggc teg aac gte tee aag ate tae gag gcc ate aag teg 2112 
Val Thr lie Gly Ser Asn Val Ser Lys lie Tyr Glu Ala lie Lys Ser 
690 695 700 

gge agg ate aac aac gtc etc etc aag atg etc get tagacactct 2158 

Gly Arg lie Asn Asn Val Leu Leu Lys Met Leu Ala 
705 710 715 

tcccactetc geateccttc cataccetat ceegcctgca etettaggae tcgcttettg 2218 

tcggaetegg atctcgcatc gcttctttcg ttcttggctg cctctctaga ccgtgtccgt 2278 

attaeetega gattgtgaat acaageagta eccatccacg eatecgataa atcagggaga 2338 

gaatctacgc ttgcgggagc ttcttgcgca taaactgtcg agtgcgggcg ttagtgcgaa 2398 

gtcaacgaag gcgagtggea gcggctcact accgcctcga g 2439 



<210> 19 
<211> 716 
<212> PRT 

<213> Rhodotorula toruloides 
<400> 19 

Met Ala Pro Ser Leu Asp Ser lie Ser His Ser Phe Ala Asn Gly Val 
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Ala Ser Ala Lys Gin Ala Val Asn Gly Ala Ser Thr Asn Leu Ala Val 
20 25 30 

Ala Gly Ser His Leu Pro Thr Thr Gin Val Thr Gin Val Asp lie Val 
35 40 45 

Glu Lys Met Leu Ala Ala Pro Thr Asp Ser Thr Leu Glu Leu Asp Gly 
50 55 60 

Tyr Ser Leu Asn Leu Gly Asp Val Val Ser Ala Ala Arg Lys Gly Arg 
65 70 75 80 

Pro Val Arg Val Lys Asp Ser Asp Glu lie Arg Ser Lys lie Asp Lys 



Ser Val Glu Phe Leu Arg Ser Gin Leu Ser Met Ser Val Tyr Gly Val 
100 105 110 

Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr Arg Thr Glu Asp Ala lie 
115 120 125 

Ser Leu Gin Lys Ala Leu Leu Glu His Gin Leu Cys Gly Val Leu Pro 
130 135 140 

Ser Ser Phe Asp Ser Phe Arg Leu Gly Arg Gly Leu Glu Asn Ser Leu 
145 150 155 160 

Pro Leu Glu Val Val Arg Gly Ala Met Thr lie Arg Val Asn Ser Leu 
165 170 175 

Thr Arg Gly His Ser Ala Val Arg Leu Val Val Leu Glu Ala Leu Thr 
180 185 190 

Asn Phe Leu Asn His Gly He Thr Pro He Val Pro Leu Arg Gly Thr 
195 200 205 

He Ser Ala Ser Gly Asp Leu Ser Pro Leu Ser Tyr He Ala Ala Ala 
210 215 220 

He Ser Gly His Pro Asp Ser Lys Val His Val Val His Glu Gly Lys 
225 230 235 240 

Glu Lys He Leu Tyr Ala Arg Glu Ala Met Ala Leu Phe Asn Leu Glu 
245 250 255 

Pro Val Val Leu Gly Pro Lys Glu Gly Leu Gly Leu Val Asn Gly Thr 
260 265 270 
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Ala Val Ser Ala Ser Met Ala Thr Leu Ala Leu His Asp Ala His Met 
275 280 285 

Leu Ser Leu Leu Ser Gin Ser Leu Thr Ala Met Thr Val Glu Ala Met 
290 295 300 

Val Gly His Ala Gly Ser Phe His Pro Phe Leu His Asp Val Thr Arg 
305 310 315 320 

Pro His Pro Thr Gin lie Glu Val Ala Gly Asn lie Arg Lys Leu Leu 
325 330 335 

Glu Gly Ser Arg Phe Ala Val His His Glu Glu Glu Val Lys Val Lys 
340 345 350 

Asp Asp Glu Gly lie Leu Arg Gin Asp Arg Tyr Pro Leu Arg Thr Ser 
355 360 365 

Pro Gin Trp Leu Gly Pro Leu Val Ser Asp Leu lie His Ala His Ala 
370 375 380 

■Val Leu Thr lie Glu Ala Gly Gin Ser Thr Thr Asp Asn Pro Leu He 
385 390 395 400 

Asp Val Glu Asn Lys Thr Ser His His Gly Gly Asn Phe Gin Ala Ala 
405 410 415 

Ala Val Ala Asn Thr Met Glu Lys Thr Arg Leu Gly Leu Ala Gin He 
420 425 430 

Gly Lys Leu Asn Phe Thr Gin Leu Thr Glu Met Leu Asn Ala Gly Met 
435 440 445 

Asn Arg Gly Leu Pro Ser Cys Leu Ala Ala Glu Asp Pro Ser Leu Ser 
450 455 460 

Tyr His Cys Lys Gly Leu Asp lie Ala Ala Ala Ala Tyr Thr Ser Glu 
465 470 475 480 

Leu Gly His Leu Ala Asn Pro Val Thr Thr His Val Gin Pro Ala Glu 
485 490 495 

Met Ala Asn Gin Ala Val Asn Ser Leu Ala Leu He Ser Ala Arg Arg 
500 505 510 

Thr Thr Glu Ser Asn Asp Val Leu Ser Leu Leu Leu Ala Thr His Leu 
515 520 525 
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Tyr Cys Val Leu Gin Ala lie Asp Leu Arg Ala lie Glu Phe Glu Phe 
530 535 540 



Lys Lys Gin Phe Gly Pro Ala He Val Ser Leu He Asp Gin His Phe 
545 550 555 560 

Gly Ser Ala Met Thr Gly Ser Asn Leu Arg Asp Glu Leu Val Glu Lys 
565 570 575 

Val Asn Lys Thr Leu Ala Lys Arg Leu Glu Gin Thr Asn Ser Tyr Asp 
580 585 590 

Leu Val Pro Arg Trp His Asp Ala Phe Ser Phe Ala Ala Gly Thr Val 
595 600 605 

Val Glu Val Leu Ser Ser Thr Ser Leu Ser Leu Ala Ala Val Asn Ala 
610 615 620 

Trp Lys Val Ala Ala Ala Glu Ser Ala He Ser Leu Thr Arg Gin Val 
625 630 635 640 

Arg Glu Thr Phe Trp Ser Ala Ala Ser Thr Ser Ser Pro Ala Leu Ser 
645 650 655 

Tyr Leu Ser Pro Arg Tnr Gin He Leu Tyr Ala Phe Val Arg Glu Glu 
660 665 670 

Leu Gly Val Lys Ala Arg Arg Gly Asp Val Phe Leu Gly Lys Gin Glu 
675 680 685 

Val Thr He Gly Ser Asn Val Ser Lys He Tyr Glu Ala He Lys Ser 
690 695 700 

Gly Arg He Asn Asn Val Leu Leu Lys Met Leu Ala 
705 710 715 



<210> 20 
<211> 2475 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_dif f erence 

<222> (13, 34, 46, 49, 51, 57, 59, 68, 69, 73, 75 - 77, 79, 82, 84, 

102, 106, 108, 115, 119, 144, 145, 164, 168, 190, 191, 195, 199 
201, 233, 251, 264, 266, 276, 284, 312, 315, 330, 331, 333, 340 
348, 357, 405, 423, 450, 456, 463, 468, 475, 483, 555, 570, 675 
681, 715, 716, 723, 729, 730, 732, 743, 744, 758, 783, 921, 963 
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1042, 1043, 1176, 1197, 1241, 1270, 1281, 1308, 1380, 1383, 1407, 

1446, 144^^, 1452, 1470, 1488, 1509, 1542, 1554, 1563, 1617, 1675, 

1677, 1678, 1681, 1683, 1684, 1690, 1693, 1708, 1710, 1723 - 

1735, 1745, 1762, 1768, 1776, 1855 - 1862, 1872 - 1875, 1880, 

1881, 1895, 1950, 1952, 1962, 1971, 1976, 2001, 2145, 2146, 2151, 

2183, 2187 - 2189, 2191, 2193 - 2195, 2197, 2199, 2206, 2208 - 

2212, 2215 - 2217, 2219, 2221 - 2223, 2226 - 2233, 2236, 2239, 
2241 -2243, 2247, 2248, 2251, 2254, 2256 - 2260, 2265, 2266, 

2268, 2269, 2271, 2272, 2274 - 2280, 2282 - 2285, 2287, 2289, 

2290, 2293, 2294, 2298, 2300 - 2303, 2305, 2307, 2308, 2312, 

2313, 2315 - 2319, 2322 - 2324, 2326, 2327, 2329 - 2335, 2337 - 

2339, 2341, 2344 - 2346, 2349 - 2351, 2354, 2356, 2358 - 2363, 

2365, 2366, 2368, 2371, 2373, 2374, 2377, 2379, 2380, 2382, 2384, 

2385, 2387, 2390, 2392, 2393, 2395 - 2403, 2405, 2409 - 2475) 

<223> n = A or C or G or T; "n" indicates no consensus at that position 

<223> Description of Artificial Sequence: Consensus 
Sequence of SEQ ID NOs : 12, 16, and 18 



<400> 20 

atggccccct ccntcgactc gatcgcgacc tcgntcgcca acggcntcnc naacggntng 60 
cacgccgnnc cgncnnngnc gncnacgggc gccacgtcca cnctcngncg gccgncgcng 120 
gctcgctcct cccgaccacc cagnngacgc agctcgacat cgtngagnag atcctcgccg 180 
accccaccgn nacgnacgnn ntcgaactcg acgggtacac cctcaccctc ggngacgtcg 240 
tcggcgccgc ncgcaagggc cgcncngtcc gcgtcncaga cagncgacga gatccgcgca 300 
aagatcgaca anagngtcga gttcctccgn ncncagctcn acaacagngt ctacggngtc 3 60 
acgactggtt tcggcggctc ggccgacacc cggactgagg atgcnatctc gctccagaag 420 
gcnctcctcg agcaccagct ctgcggtgtn ctcccnacgt cgntcgantc cttcngcctc 480 
ggncgcggcc tcgagaactc gcttccgctc gaggtcgtcc gcggcgccat gaccatccgc 54 0 
gtcaactcgc tcacncgcgg ccactcggcn gtccgcctcg tcgtcctcga ggcgctcacc 600 
aacttcctca accacggcat cacccccatc gtccccctcc gcggcaccat ctcggcgtcg 660 
ggcgacctct ccccnctctc ntacatcgcc gccgccatca ccggtcaccc ggacnncaag 720 
gtncacgtnn tncacgaggg canngagaag atcatgtncg cccgcgaggc gatcgcgctc 780 
ttnggtctcg agcccgtcgt cctcggcccg aaggagggtc tcggtctcgt caacggcacg 840 
gccgtctccg cctcgatggc gaccctcgct ctgcacgacg cacacatgct ctcgctcctc 900 
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tcgcaggcgc tcacggctct nacggtcgag gccatggt eg gccacgccgg ctcgttccac 960 
ccnttcctcc acgacgtcac gcgccctcac ccgacccaga tcgaggtcgc gcgcaacatc 1020 
cgcacgctcc tcgagggcag cnngtttgcc gtccaccacg aggaggaggt caaggtcaag 1080 
■ gacgacgagg gcattctccg ccaggaccgc tacccgctcc gcacgtcgcc tcagtggctc 1140 
ggcccgctcg tcagcgacat gattcacgcc cacgcngtcc tctcgctcga ggccggncag 1200 
tcgacgaccg acaacccgct catcgacgtc gagaacaaga ngacccacca cggcggcaac 1260 
ttccaggcgn ccgctgtcgc naacacgatg gagaagactc gcctcgcnct cgccctgatc 1320 
ggcaagctca acttcacgca gctcaccgag atgctcaacg ccggcatgaa ccgcggcctn 1380 
ccntcctgcc tcgctgccga ggacccntcg ctctcctatc actgcaaggg cctcgacatt 1440 
gccgcngcng cntacacttc ggagctcggn caccttgcca acccggtnac gacccacgtc 1500 
cagccggcng agatgggcaa ccaggccgtc aactcgctcg cnctcatctc ggcncgccgc 1560 
acngccgagg ccaacgacgt cctttctctc ctcctcgcca cccacctc-ca ctgcgtnctc 1620 
caggccgtcg acctccgcgc gatggagttc gagttcaaga agcagttcga cccgntnntc 1680 
ncnncgctcn tcnagcagca ctttggcncn gccctcgacg gcnnnnnnnn nnnnnacgaa 17 4 0 
ctcgnggaca aggtcaacaa gncgctcnac aagcgnctcg agcagaccaa ctcgtacgac 1800 
ctcgagccgc gctggcacga cgccttctcg ttcgcgaccg gcaccgtcgt cgagnnnnnn 1860 
nngtcctcgc cnnnngccan naggtctcgc tcgcngccgt caacgcctgg aaggtcgcct 1920 
ccgccgagaa ggccatctcg ctcacgcgcn angtccgcga cnccttctgg ncggcnccgt 1980 
cgtcgtcgtc gcccgcgctc ncgtacctct cgccgcgcac gcgcgtcctg tactcgttcg 2040 
tccgcgagga gctcggcgtc aaggcccgcc gcggcgacgt cttcctcggc aagcaggagg 2100 
tgacgatcgg caccaacgtc tcccgcatct acgaggccat caagnncggc ngcatcaacc 2160 
acgtcctcgt caagatgctc gcntagnnnc ncnnncnann ctcgcntnnn nnccnnncnc 2220 
nnnccnnnnn nnnctnttng nnntcgnntc ntgncnnnnn cgganntnnc nncnnnnnnn 2280 
tnnnncntnn ctnnctcncn nnnancnngt cnntnnnnnc tnnngnntnn nnnnncnnnc 2340 
ngtnnncann nacncntnnn nnnanncngg nanngantnn angnntncgn gnncnnnnnn 24 00 
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nnnanaaann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnannnnnnn nnnnnnnnnn 24 60 

2475 



nnnnnnnnnn 

<210> 21 
<211> 726 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> SITE 

<222> (12, 16, 17, 20, 24, 25, 28 - 36, 38, 42, 47, 48, 56, 57, 62, 66 
67, 88, 92, 99, 110, 113, 354, 158, 243, 247, 252, 347, 413, 426 
=56, 558 - 561, 564, 569 - 571, 5^3 - 578, 581, 582, 585, 587, 
589, 610, 618, 622 - 626, 639, 649, 652, 653, 656, 666, 719) 

<223> "Xaa" means any amino acid; "Xaa" means no consensus at that 

position 

<223> Description of Artificial Sequence: Consensus of 
SEQ ID NOs: 13, 17, and 19 

<400> 21 

Met Ala Pro Ser Leu Asp Ser He Ala Thr Ser Xaa Ala Asn Gly Xaa 
15 10 15 



Xaa 



Asn Gly Xaa His Ala Ala Xaa Xaa Ala Ser Xaa Xaa Xaa Xaa Xaa 



Xaa Xaa Xaa Xaa Ala Xaa Ala Gly Ser Xaa Leu Pro Thr Thr Xaa Xaa 
35 40 45 

Thr Gin Leu Asp He Val Glu Xaa Xaa Leu Ala Asp Pro Xaa Thr Asp 
50 55 60 

Asp Xaa Xaa Glu Leu Asp Gly Tyr Ser Leu Thr Leu Gly Asp Val Val 
65 70 75 80 

Gly Ala Ala Arg Lys Gly Arg Xaa Val Arg Val Xaa Asp Ser Asp Glu 



He Arg Xaa Lys He Asp Lys Ser Val Glu Phe Leu Arg Xaa Gin Leu 
100 105 110 

Xaa Asn Ser Val Tyr Gly Val Thr Thr Gly Phe Gly Gly Ser Ala Asp 
115 120 125 

Thr Arg Thr Glu Asp Ala He Ser Leu Gin Lys Ala Leu Leu Glu His 
130 135 140 
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Gin Leu Cys Gly Val Leu Pro Thr Ser Xaa Asp Ser Phe Xaa Leu Gly 
145 150 155 160 

Arg Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly Ala Met 
165 170 175 

Thr lie Arg Val Asn Ser Leu Thr Arg Gly His Ser Ala Val Arg Leu 
180 185 190 

Val Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly He Thr Pro 
195 200 205 

He Val Pro Leu Arg Gly Thr He Ser Ala Ser Gly Asp Leu Ser Pro 
210 215 220 

Leu Ser Tyr He Ala Ala Ala He Thr Gly His Pro Asp Ser Lys Val 
225 230 235 240 

His Val Xaa His Glu Gly Xaa Glu Lys He Met Xaa Ala Arg Glu Ala 
245 250 255 

He Ala Leu Phe Gly Leu Glu Pro Val Val Leu Gly Pro Lys Glu Gly 
260 265 270 

Leu Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala Thr Leu 
275 280 285 

Ala Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gin Ala Leu Thr 
290 295 300 

Ala Leu Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe His Pro 
305 310 315 320 

Phe Leu His Asp Val Thr Arg Pro His Pro Thr Gin He Glu Val Ala 
325 330 335 

Arg Asn He Arg Thr Leu Leu Glu Gly Ser Xaa Phe Ala Val His His 
340 345 350 

Glu Glu Glu Val Lys Val Lys Asp Asp Glu Gly He Leu Arg Gin Asp 
355 360 3S5 

Arg Tyr Pro Leu Arg Thr Ser Pro Gin Trp Leu Gly Pro Leu Val Ser 
370 375 380 

Asp Met He His Ala His Ala Val Leu Ser Leu Glu Ala Gly Gin Ser 
385 390 395 400 
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Thr Thr Asp Asn Pro Leu lie Asp Val Glu Asn Lys Xaa Thr His His 
405 410 415 



Gly Gly Asn Phe Gin Ala Ser Ala Val Xaa Asn Thr Met Glu Lys Thr 
420 425 430 

Arg Leu Ala Leu Ala Leu He Gly Lys Leu Asn Phe Thr Gin Leu Thr 
435 440 445 

Glu Met Leu Asn Ala Gly Met Asn Arg Gly Leu Pro Ser Cys Leu Ala 
450 455 460 

Ala Glu Asp Pro Ser Leu Ser Tyr His Cys Lys Gly Leu Asp He Ala 
465 470 475 480 

Ala Ala Ala Tyr Thr Ser Glu Leu Gly His Leu Ala Asn Pro Val Thr 
485 490 495 

Thr His Val Gin Pro Ala Glu Met Gly Asn Gin Ala Val Asn Ser Leu 
500 505 510 

Ala Leu He Ser Ala Arg Arg Thr Ala Glu Ala Asn Asp Val Leu Ser 
515 520 525 

Leu Leu Leu Ala Thr His Leu Tyr Cys Val Leu Gin Ala Val Asp Leu 
530 535 540 

Arg Ala Met Glu Phe Glu Phe Lys Lys Gin Phe Xaa Pro Xaa Xaa Xaa 
545 550 555 560 

Xaa Leu Leu Xaa Gin His Phe Gly Xaa Xaa Xaa Thr Xaa Xaa Xaa Xaa 
565 570 575 

Xaa Xaa Glu Leu Xaa Xaa Lys Val Xaa Lys Xaa Leu Xaa Lys Arg Leu 
580 585 590 

Glu Gin Thr Asn Ser Tyr Asp Leu Glu Pro Arg Trp His Asp Ala Phe 
595 600 605 

Ser Xaa Ala Thr Gly Thr Val Val Glu Xaa Leu Ser Ser Xaa Xaa Xaa 
610 615 620 

Xaa Xaa Val Ser Leu Ala Ala Val Asn Ala Trp Lys Val Ala Xaa Ala 
625 630 635 640 

Glu Lys Ala He Ser Leu Thr Arg Xaa Val Arg Xaa Xaa Phe Trp Xaa 
645 650 655 
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Ala Pro Ser Ser Ser Ser Pro Ala Leu Xaa Tyr Leu Ser Pro Arg Thr 
660 665 670 



Arg Val Leu Tyr Ser Phe Val Arg Glu Glu Leu Gly Val Lys Ala Arg 
675 680 685 

Arg Gly Asp Val Phe Leu Gly Lys Gin Glu Val Thr He Gly Ser Asn 
690 695 700 

Val Ser Arg lie Tyr Glu Ala lie Lys Ser Gly Arg lie Asn Xaa Val 
705 710 715 720 

Leu Val Lys Met Leu Ala 
725 

<210> 22 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 7 4 

<400> 22 

atccatatgg ctccttcttt ggattctctt gctactaogc tcgccaacgg ctttacc 

<210> 23 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 75 

' <400> 23 

atcgcggccg catgcggatc ctcattacgc gagcatcttg acgaggacgt ggttgatgcq 
<210> 24 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 105 

<400> 24 

agtgaattca tggccccttc cttggactcg ctcgcca 
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<210> 25 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer OLI 8 0 

<400> 25 

atcgcatgct cattacgcga gcatcttgac gaggacgtgg ttgatgcg 

<210> 26 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: oil 89 
<400> 26 

taaaagatct ccaccatggc cccttccttg gactcgctc 
39 



<210> 27 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Description of Artificial Sequence: oil 90 

<400> 27 

caagcggccg cagagacgcg ggatacgaaa gaactg 
36 



<210> 28 

<211> 2741 

<212> DNA 

<213> Rhodotorula graminis 



-40- 



<220> 

<221> modif ied_base 
<222> (6) 

<223> Description of modif ied_base : m = a or 
<220> 

<221> exon 

<222> (2008) . . (2586) 

<223> 



<22G> 

<221> exon 

<222> (1822) . . (1947) 

<223> 

<220> 

<221> exon 

<222> (] 587) . . (1748) 

<223> 

<220> 

<221> exon 

<222> (1365) . . (1529) 

<223> 

<220> 

<221> exon 

<222> (961) . . (1295) 

<223> 

<220> 

<221> exon 

<222> (449) . . (880) 

<223> 

<220> 

<221> exon 
<222> (1) . . (361) 
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<223> 



<220> 

<221> terminator 
<222> (2587) .. (2589) 
<223> 

<220> 

<221> Intron 

<222> (1948) . . (2007) 

<223> 

<220> 

<221> Intron 

<222> (1749) . . (1821) 

<223> 

<220> 

<221> Intron 

<222> (1530) . . (1586) 

<223> 

<220> 

<221> Intron 

<222> (1296) . . (1364) 

<223> 

<220> 

<221> Intron 
<222> (881) . . (960) 
<223> 

<220> 

<221> Intron 
<222> (362) .. (448) 
<223> 



<400> 28 



atg gem cct tec ttg gac teg cte gee acc aeg etc gcc aac ggc ttt 
Met Ala Pro Ser Leu Asp Ser Leu Ala Thr Thr Leu Ala Asn Gly Phe 



acc aac ggc teg eac gcc get ccg acc aag teg get gcg ggc ccc act 
Thr Asn Gly Ser His Ala Ala Pro Thr Lys Ser Ala Ala Gly Pro Thr 
20 25 30 

teg get etc cgc cgc acg ccc ggc etc gat ggc eac gcc gcg eac cag 
Ser Ala Leu Arg Arg Thr Pro Gly Leu Asp Gly His Ala Ala His Gin 
35 40 45 

teg cag etc gag ate gtg cag gag cte cte age gac ccc acc gac gac 
Ser Gin Leu Glu lie Val Gin Glu Leu Leu Ser Asp Pro Thr Asp Asp 



gtc gte gag etc age ggg tac age etc ace gtc egt gac gtt gte ggc 240 
Val Val Glu Leu Ser Gly Tyr Ser Leu Thr Val Arg Asp Val Val Gly 
65 70 75 80 

gcc gcc cgc aag ggg cgc agg gte cgc gtc cag aac gac gac gag ate 288 
Ala Ala Arg Lys Gly Arg Arg Val Arg Val Gin Asn Asp Asp Glu lie 
85 90 95 

cgc gca cgc gtc gac aag age gtc gac ttc etc aag gcc cag ctt cag 336 
Arg Ala Arg Val Asp Lys Ser Val Asp Phe Leu Lys Ala Gin Leu Gin 
100 105 110 

aac teg gte tac gga gtc acc acg g tgcgttccga gacgagaggc 381 
Asn Ser Val Tyr Gly Val Thr Thr 
115 120 

ggaaatcteg ggatgccgca gcgetgaacg ctgacactcg cttggaeggc tgccgeggtc 441 

ttgcagg gt ttc ggt ggc teg gcc gac acg agg act gag gat gca gtc 489 
Gly Phe Gly Gly Ser Ala Asp Thr Arg Thr Glu Asp Ala Val 
125 130 
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age etc cag aag gcg etc ate gag cac eag etc tgc ggc gtg acg eeg 
Ser Leu Gin Lys Ala Leu He Glu His Gin Leu Cys Giy Val Thr Pro 
135 140 145 150 

acg tec gtc teg tec tte age gtc gga cgc ggc etc gag aae acg ett 
Thr Ser Val Ser Ser Phe Ser Val Gly Arg Gly Leu Glu Asn Thr Leu 
155 160 165 

eeg etc gag gtc gtc cgc ggc gee atg gte ate cgc gtc aae teg etc 
Pro Leu Glu Val Val Arg Gly Ala Met Val He Arg Val Asn Ser Leu 
170 . 175 180 

acg egt ggc cae teg gee gte cge etc gtc gtc ett gag gcg etc ace 
Thr Arg Gly His Ser Ala Val Arg Leu Val Val Leu Glu Ala Leu Thr 
185 190 195 

aae tte ttg aae eac cge ate acg ecc ate gte cce etc cgc ggc tec 
■ Asn Phe Leu Asn His Arg He Thr Pro He Val Pro Leu Arg Gly Ser 
200 205 210 

ate teg gcg teg ggc gac etc age eeg etc teg tac ate gee ggc gee 
He Ser Ala Ser Gly Asp Leu Ser Pro Leu Ser Tyr He Ala Gly Ala 
215 220 225 230 

ate acc ggt cac ccc gac gtc aag gtt cac gtt ttg cac gag gga ace 
He Thr Gly His Pro Asp Val Lys Val His Val Leu His Glu Gly Thr 
235 240 245 

gag aag ate atg ttt gcg cgc gag gee ate teg cte ttt ggt etc gag 
Glu Lys He Met Phe Ala Arg Glu Ala He Ser Leu Phe Gly Leu Glu 
250 255 260 

gca gtc g gtaegtegeg agteetgact gcagtgagct; gttcgagagt ctcccagttt 
Ala Val 



gctgactgee ctttgttcat gcgattgeag te etc ggc ccg aag gag ggt cte 
Val Leu Gly Pro Lys Glu Gly Leu 
270 
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ggt ctg gtc aac gga acg gcc gtc tec gcc teg atg gcg acc etc agt 

Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala Thr Leu Ser 
275 280 285 

ctg cac gac teg cac atg etc teg etc etc teg cag gcc ttg acg get 

Leu His Asp Ser His Met Leu Ser Leu Leu Ser Gin Ala Leu Thr Ala 
290 295 300 



etc acg gtg gag gcc atg gtc ggc cag cag ggc teg ttic gcg ecg ttc 
Leu Thr Val Glu Ala Met Val Gly Gin Gin Gly Ser Phe Ala Pro Phe 
305 310 315 320 



ate cac gae gtc tgc cge ccg cac cce 
lie His Asp Val Cys Arg Pro His Pro 
325 



ggc cag gtc gag gtc gcg cgc 1175 
Gly Gin Val Glu Val Ala Arg 
330 335 



aac ate ego acg etc ett tec ggc teg teg ttt gcc gtt gag cac gag 1223 
Asn lie Arg Thr Leu Leu Ser Gly Ser Ser Phe Ala Val Glu His Glu 
340 345 350 

gag gag gtc aag gtc aag gac gac gag ggc att ctt cgc cag gac cgc 1271 
Glu Glu Val Lys Val Lys Asp Asp Glu Gly lie Leu Arg Gin Asp Arg 
355 360 365 

tac ccg etc cgc acg teg cot cag gttcgtcccc tctctctccc cttccctceg 1325 
Tyr Pro Leu Arg Thr Ser Pro Gin 
370 375 

tccgaccggc gcgtcgagac ttacgttttg cgtatccag ttc etc ggc ccg etc 1379 

Phe Leu Gly Pro Leu 
380 



gtg gag gac atg atg cac gcc tac teg act etc teg etc gag aac aac 1427 
Val Glu Asp Met Met His Ala Tyr Ser Thr Leu Ser Leu Glu Asn Asn 
385 390 395 



acg acg acc gac aac ccg etc etc gac gtc gag aac aag cag acc gcg 1475 
Thr Thr Thr Asp Asn Pro Leu Leu Asp Val Glu Asn Lys Gin Thr Ala 
400 405 410 
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cac ggc ggc aac ttc cag gcg teg get gtc teg att teg atg gag aag 
His Gly Gly Asn Phe Gin Ala Ser Ala Val Ser lie Ser Met Glu Lys 
415 420 425 



1523 



ace agg tgegtetctc getgccttcg tactecgatc ttgtgctgaa tgttettctc 1579 

Thr Arg 

430 

ctgcagg etc gca etc gee etc ate ggc aag etc aac ttc aeg cag tge 1528 
Leu Ala Leu Ala Leu lie Gly Lys Leu Asn Phe Thr Gin Cys 
435 440 445 

acc gag ttg etc aac get gcc atg aac cgc ggc etg cct teg tge etc 167 6 
Thr Glu Leu Leu Asn Ala Ala Met Asn Arg Gly Leu Pro Ser Cys Leu 
450 455 460 

get gee gag gac ccg teg etc aac tat cac ggc aag ggc ttg gac att 1724 
Ala Ala Glu Asp Pro Ser Leu Asn Tyr His Gly Lys Gly Leu Asp lie 
465 470 475 

cac ate get get tac get teg gag gtgagccgte gacgttetcc gecgtegcte 1778 
His lie Ala Ala Tyr Ala Ser Glu 
480 485 

gtccccttca gegcacccag gctgacttcc tttccctctg tag etc ggc cac ett 1833 

Leu Gly His Leu 



gcc aac ccg gtc act acc ttc gtc cag ecc gea gag atg ggc aac cag 

Ala Asn Pro Val Thr Thr Phe Val Gin Pro Ala Glu Met Gly Asn Gin 

490 495 500 505 

gcc gtc aac teg etc get etc ate tec gcg cgc cgc act gcc gag gcc 

Ala Val Asn Ser Leu Ala Leu lie Ser Ala Arg Arg Thr Ala Glu Ala 

510 515 520 

aac gae gte ett tet etc gtgcgttcgt gtcgcaatga gtcccgacgc 
Asn Asp Val Leu Ser Leu 
525 
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aatagcgact gactgcgcga tcctgagcag ctt etc gcc teg cac ctg tac tgc 2031 
Leu Leu Ala Ser His Leu Tyr Cys 
530 535 



acg etc cag gcc gtc gac etc cgc gcg atg gag etc gac ttc aag aag 
Thr Leu Gin Ala Val Asp Leu Arg Ala Met Glu Leu Asp Phe Lys Lys 
540 545 550 

cag ttc gac ccg ctt etc ccg act etc etc cag eag cac etc ggc act 
■ Gin Phe Asp Pro Leu Leu Pro Thr Leu Leu Gin Gin His Leu Gly Thr 
555 560 565 

ggc etc gac gtc aac gea ctt gcg etc gag gtc aag aag gcg etc aac 
. Gly Leu Asp Val Asn Ala Leu Ala Leu Glu Val Lys Lys Ala Leu Asn 
570 575 580 

aag cgt etc gag cag acg acg acg tac gac etc gag ccg cgc tgg cac 
Lys Arg Leu Glu Gin Thr Thr Thr Tyr Asp Leu Glu Pro Arg Trp His 
585 590 595 

gac gcc ttc teg tac gcg acc ggc acc gtc gtc gag etc etc teg tec 
Asp Ala Phe Ser Tyr Ala Thr Gly Thr Val Val Glu Leu Leu Ser Ser 
600 605 610 615 

teg ccc tct gcc aac gtc acc ctt act gcc gtc aac gcg tgg aag gtt 
Ser Pro Ser Ala Asn Val Thr Leu Thr Ala Val Asn Ala Trp Lys Val 
620 625 630 

gcc teg gcc gag aag gcc ate teg etc acg cgc gag gtg cgc aac cgc 
Ala Ser Ala Glu Lys Ala lie Ser Leu Thr Arg Glu Val Arg Asn Arg 
635 640 645 

ttc tgg cag acg ccg tct teg cag gcg ccg gcg cac gea tac etc teg 
Phe Trp Gin Thr Pro Ser Ser Gin Ala Pro Ala His Ala Tyr Leu Ser 
650 655 660 

ccg cgc acg cgc gtc ctg tac teg ttc gtg cgc gag gag etc ggc gtg 
Pro Arg Thr Arg Val Leu Tyr Ser Phe Val Arg Glu Glu Leu Gly Val 
665 670 675 
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cag gcg cgc cgc ggc gac gtg ttt gtc ggc gtg cag cag gag acg ate 2511 
Gin Ala Arg Arg Gly Asp Val Phe Val Gly Val Gin Gin Glu Thr He 
680 685 690 695 

ggg age aac gtc teg cgc ate tac gag gee ate aag gac ggc cgc ate 2559 
Gly Ser Asn Val Ser Arg He Tyr Glu Ala He Lys Asp Gly Arg He 
700 705 710 

aac cac gtc etc gtc aag atg etc gcg taaggceega geaageetcg 2606 
Asn His Val Leu Val Lys Met Leu Ala 
715 720 

ectagacgec egcctcacec caagaceagc ttttcgacgt cgtgtcgtgc caagaacgga 2666 

etttcctcca tacaeatgtc gtcttaetct ctcgcegtca tcacgtctct eagttcttte 2726 

gtatcccgeg tetet 2741 



<210> 29 

<211> 2163 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Preferred 

theoretical sequence based in part on SEQ ID NO: 20 

<220> 

<221> CDS 

<222> (1) . . (2163) 

<223> 

<400> 29 

atg gee ecc tec btc gac teg ate gcg ace teg btc gee aac ggc btc 4 8 

Met Ala Pro Ser Xaa Asp Ser He Ala Thr Ser Xaa Ala Asn Gly Xaa 
15 10 15 

dcv aac ggv thg cac gee get ccg hce aag ycg sew acg ggc gee acg 96 
Xaa Asn Xaa Xaa His Ala Ala Pro Xaa Lys Xaa Xaa Thr Gly Ala Thr 
20 25 30 

tec ach etc mgiti egg ccg bcc dgg etc get cct ecc gee acc cag vhg 144 
Ser Xaa Leu Xaa Arg Pro Xaa Xaa Leu Ala Pro Pro Ala Thr Gin Xaa 



acg cag etc gac ate gtb gag vag ate etc gcc gae ecc ace gac gac 
Thr Gin Leu Asp lie Xaa Glu Xaa He Leu Ala Asp Pro Thr Asp Asp 
50 55 60 

gws vtc gaa etc gac ggg tac acc etc acc etc ggh gac gtc gtc ggc 
Xaa Xaa Glu Leu Asp Gly Tyr Thr Leu Thr Leu Xaa Asp Val Val Gly 
65 70 75 80 

gcc gcb cgc aag ggc cgc hcb gtc cgc gtc cag aca gmc gac gag ate 
Ala Ala Arg Lys Gly Arg Xaa Val Arg Val Gin Thr Xaa Asp Glu He 
85 90 95 

cgc gca aag ate gac aav agb gtc gag ttc etc cgb deb cag etc bac 
Arg Ala Lys lie Asp Xaa Xaa Val Glu Phe Leu Arg Xaa Gin Leu Xaa 
100 105 110 

aac agb gtc tac ggh gtc acg act ggt ttc ggc ggc teg gee gac acc 
Asn Xaa Val Tyr Xaa Val Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr 
115 120 125 

egg act gag gat gov ate teg etc cag aag gcb etc etc gag cac cag 
Arg Thr Glu Asp Ala He Ser Leu Gin Lys Ala Leu Leu Glu His Gin 
130 135 140 

etc tgc ggt gtb etc ccb acg teg dtc gab tec ttc vgc etc ggh cgc 
Leu Cys Gly Xaa Leu Xaa Thr Ser Xaa Xaa Ser Phe Xaa Leu Xaa Arg 
145 150 155 160 

ggc etc gag aac teg ctt ccg etc gag gtc gtc cgc ggc gcc atg acc 
Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly Ala Met Thr 
165 170 175 

ate cgc gtc aac teg etc acb cgc ggc cac teg gcb gtc cgc etc gtc 
He Arg Val Asn Ser Leu Xaa Arg Gly His Ser Ala Val Arg Leu Val 
180 185 190 

gtc etc gag gcg etc acc aac ttc etc aac cac ggc ate acc ecc ate 
Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly He Thr Pro He 
195 200 205 

gtc ecc etc cgc ggc acc ate teg gcg teg ggc gac etc tec ccb etc 
Val Pro Leu Arg Gly Thr He Ser Ala Ser Gly Asp Leu Ser Xaa Leu 
210 215 220 

tcb tac ate gcc gcc gcc ate acc ggt cac ccg gac dbc aag gtb cac 
Xaa Tyr He Ala Ala Ala He Thr Gly His Pro Asp Xaa Lys Xaa His 
225 230 235 240 

gty kts cac gag ggc ams gag aag ate atg the gcc cgc gag gcg ate 
Xaa Xaa His Glu Gly Xaa Glu Lys He Met Xaa Ala Arg Glu Ala He 
245 250 255 

gcg etc ttb ggt etc gag ecc gtc gtc etc ggc ccg aag gag ggt etc 
Ala Leu Xaa Gly Leu Glu Pro Val Val Leu Gly Pro Lys Glu Gly Leu 
260 265 270 

ggt etc gtc aac ggc acg gcc gtc tec gcc teg atg gcg acc etc get 
Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala Thr Leu Ala 
275 280 285 
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ctg cac gac gca cac atg etc teg etc etc teg eag gcg etc aeg get 
Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gin Ala Leu Thr Ala 

290 295 300 

etb aeg gte gag gee atg gtc gge cae gee ggc teg ttc cae ccv ttc 
Xaa Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe His Xaa Phe 
305 310 315 320 

ete cae gac gte aeg egc eet cae ccg acc cag ate gag gtc gcg cgc 
Leu His Asp Val Thr Arg Pro His Pro Thr Gin lie Glu Val Ala Arg 
325 330 335 

aac ate cgc aeg etc etc gag ggc age hvg ttt gee gtc cac cae gag 
Asn lie Arg Thr Leu Leu Glu Gly Ser Xaa Phe Ala Val His His Glu 
340 345 350 

gag gag gtc aag gtc aag gac gac gag ggc att etc cgc cag gac egc 
Glu Glu Val Lys Val Lys Asp Asp Glu Gly lie Leu Arg Gin Asp Arg 
355 360 365 

tac ccg etc cgc aeg teg cct cag tgg etc ggc ecg etc gte age gae 
Tyr Pro Leu Arg Thr Ser Pro Gin Trp Leu Gly Pro Leu Val Ser Asp 
370 375 380 

atg att cac gee cac gcb gtc etc teg etc gag gcc gag teg aeg acc 
Met He His Ala His Ala Val Leu Ser Leu Glu Ala Glu Ser Thr Thr 
385 390 395 400 

gac aac ccg etc ate gac gtc gag aac aag ahg ace cac cac ggc ggc 
Asp Asn Pro Leu lie Asp Val Glu Asn Lys Xaa Thr His His Gly Gly 
405 410 415 

aac ttc cag gcg dec get gtc gov aac aeg atg gag aag act cgc etc 
Asn Phe Gin Ala Xaa Ala Val Ala Asn Thr Met Glu Lys Thr Arg Leu 
420 425 430 

, gev etc gee ctg ate ggc aag etc aac ttc aeg cag etc acc gag atg 
Ala Leu Ala Leu He Gly Lys Leu Asn Phe Thr Gin Leu Thr Glu Met 
435 440 445 

etc aac gcc ggc atg aac cgc ggc ctb ccb tec tgc etc get gcc gag 
Leu Asn Ala Gly Met Asn Arg Gly Xaa Xaa Ser Cys Leu Ala Ala Glu 
450 455 460 

gac ccb teg etc tec tat cae tgc aag ggc etc gae att gcc gcb gcb 
Asp Xaa Ser Leu Ser Tyr His Cys Lys Gly Leu Asp He Ala Ala Ala 
465 470 475 480 

gcb tac act teg gag etc ggh cac ctt gcc aac ccg gtb aeg acc cae 
Ala Tyr Thr Ser Glu Leu Xaa His Leu Ala Asn Pro Xaa Thr Thr His 
485 490 495 

gtc cag ccg geh gag atg ggc aac eag gee gtc aac teg etc gcb etc 
Val Gin Pro Arg Glu Met Gly Asn Gin Ala Val Asn Ser Leu Ala Leu 
500 505 510 

ate teg gcb cgc cgc acb gcc gag gcc aac gac gtc ctt tct etc etc 
He Ser Ala Arg Arg Xaa Ala Glu Ala Asn Asp Val Leu Ser Leu Leu 
515 " 520 525 
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etc gcc acc cac etc tac tgc gtb etc cag gcc gtc gac etc cqc geg 
Leu Ala Thr His Leu Tyr Cys Xaa Leu Gin Ala Val Asp Leu Arg Ala 
530 535 540 

atg gag ttc gag ttc aag aag cag ttc gac ccg vtb vtc vcb dcg etc 
Met Glu Phe Glu Phe Lys Lys Gin Phe Asp Pro Xaa Xaa Xaa Xaa Leu 
545 550 555 560 

htc vag cag cac ttt ggc dcy gcc etc gac ggc wac gaa etc ghg gac 
Xaa Xaa Gin His Phe Gly Xaa Ala Leu Asp Gly Xaa Glu Leu Xaa Asp 
565 570 575 

aag gtc aac aag deg etc dac aag egb etc gag cag acc aac teg tac 
Lys Val Asn Lys Xaa Leu Xaa Lys Arg Leu Glu Gin Thr Asn Ser Tyr 
580 585 590 

gac etc gag ccg cgc tgg cac gac gcc ttc teg ttc gcg acc ggc acc 
Asp Leu Glu Pro Arg Trp His Asp Ala Phe Ser Phe Ala Thr Gly Thr 
595 600 605 

gtc gtc gag etc etc teg tec teg ccb yet gcc aag gtc teg etc geb 
Val Val Glu Leu Leu Ser Ser Ser Xaa Xaa Ala Lys Val Ser Leu Ala 
610 615 620 

gcc gtc aac gcc tgg aag gtc gee tec gcc gag aag gee ate teg etc 
Ala Val Asn Ala Trp Lys Val Ala Ser Ala Glu Lys Ala lie Ser Leu 
625 630 635 640 

aeg cgc bav gtc cgc gac hec ttc tgg beg geb ccg teg teg teg teg 
Thr Arg Xaa Val Arg Asp Xaa Phe Trp Xaa Ala Pro Ser Ser Ser Ser 
645 650 655 

CCG gcg etc dcg tac etc teg ccg cgc acg cgc gtc ctg tac teg ttc 
Pro Ala Leu Xaa Tyr Leu Ser Pro Arg Thr Arg Val Leu Tyr Ser Phe 
660 665 670 

gtc cgc gag gag etc ggc gtc aag gcc cgc cgc ggc gac gtc ttc etc 
Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg Gly Asp Val Phe Leu 
675 680 685 

ggc aag cag gag gtg acg ate ggc ace aac gtc tec cgc ate tac gag 
Gly Lys Gin Glu Val Thr lie Gly Thr Asn Val Ser Arg He Tyr Glu 
690 695 700 

gcc ate aag dvc ggc hgc ate aac cac gtc etc gtc aag atg etc gcd 
Ala He Lys Xaa Gly Xaa He Asn Kis Val Leu Val Lys Met Leu Ala 
705 710 715 720 



<210> 30 

<211> 720 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Preferred 
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theoretical sequence based in part on SEQ ID NO: 20 



<220> 

<221> misc_f eature 

<222> (5) . . (5) 

<223> The 'Xaa' at location 5 stands for Val, Leu, or Phe . 
<220> 

<221> misc_feature 

<222> (12) . . (12) 

<223> The 'Xaa' at location 12 stands for Val, Leu, or Phe. 
<220> 

<221> misc_feature 

<222> (16) . . (16) 

<223> The 'Xaa' at location 16 stands for Val, Leu, or Phe. 
<220> 

<221> misc_feature 

<222> (17) . . (17) 

<223> The 'Xaa' at location 17 stands for Thr, Ala, or Ser. 
<220> 

<221> misc_f eature 

<222> (19) . . (19) 

<223> The 'Xaa'- at location 19 stands for Gly. 
<220> 

<221> misc_feature 

<222> (20) . . (20) 

<223> The 'Xaa' at location 20 stands for a stop codon, Ser, or Leu. 
<220> 

<221> inisc_f eature 

<222> (25) . . (25) 

, <223> The 'Xaa' at location 25 stands for Thr, Pro, or Ser. 

<220> 

<221> inisc_f eature 

. <222> (27).. (27) 

<223> The 'Xaa' at location 27 stands for Pro, or Ser. 

<220> 

<221> misc_f eature 
<222> (28) . . (28) 

<223> The 'Xaa' at locarion 28 stands for Ala, or Pro. 
<220> 

<221> misc_feature 
<222> (34) . . (34) 

<223> The 'Xaa' at location 34 stands for Thr. 
<220> 

<221> misc_feature 
<222> (36) . . (36) 

<223> The 'Xaa' at location 36 stands for Arg, or Ser. 

<220> 
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<221> misc_f eature 

<222> (39) . . (39) 

<223> The 'Xaa' at location 39 stands for Ala, Pro, or Ser . 
<220> 

<221> misc_feature 

<222> (40) . . (40) 

<223> The 'Xaa' at location 40 stands for Arg, Gly, or Trp. 
<220> 

<221> misc_f eature 

<222> (48) . . (48) 

<223> The 'Xaa' at location 48 stands for Lys , Thr, Met, Glu, Ala, 
Val, Gin, Pro, or Leu. 

<220> 

<221> misc_feature 

<222> (54) . . (54) 

<223> The 'Xaa' at location 54 stands for Val. 
<220> 

<221> misc_feature 

<222> (56) . . (56) 

<223> The 'Xaa' at location 56 stands for Lys, Glu, or Gin. 

<220> . 

<221> inisc_f eature 

<222> (65) . . (65) 

<223> The 'Xaa' at location 65 stands for Glu, Asp, or Val. 
<220> 

<221> misc_feature 

<222> (66) . . (66) 

<223> The 'Xaa' at location 66 stands for lie, Val, or Leu. 

<220> 

<221> mi sc_f eature 

<222> (76) . . (76) 

<223> The 'Xaa' at location 76 stands for Gly. 
<220> 

<221> 'nisc_feature 

<222> (87) . . (87) 

<223> The 'Xaa' at location 87 stands for Thr, Pro, or Ser. 
<220> 

<221> misc_f eature 

<222> (93) . . (93) 

<223> The 'Xaa' at location 93 stands for Asp, or Ala. 
<220> 

<221> inisc_feature 

<222> (102) . . (102) 

<223> The 'Xaa' at location 102 stands for Lys, or Asn. 
<220> 

<221> misc_feature 

<222> (103) . . (103) 

<223> The 'Xaa' at location 103 stands for Arg, or Ser. 
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<220> 

<221> misc_f eature 

<222> (109) . . (109) 

<223> The 'Xaa' at location 109 stands for Thr, Ala, or Ser. 
<220> 

<221> misc_f eature 

<222> (112) . . (112) 

<223> The 'Xaa' at location 112 stands for Asp, His, or Tyr . 
<220> 

<221> inisc_f eature 

<222> (114) . . (114) 

<223> The 'Xaa" at location 114 stands for Arg, or Ser. 
<220> 

<221> inisc_f eature 

<222> (117) . . (117) 

<223> The 'Xaa' at location 117 stands for Gly. 
<220> 

<221> inisc_f eature 

<222> (148) . . (148) 

<223> The 'Xaa' at location 148 stands for Val . 
<220> 

<221> misc_f eature 

<222> (150) . . (150) 

<223> The 'Xaa' at location 150 stands for Pro. 
<220> 

<221> misc_feature 

<222> (153) . . (153) 

<223> The 'Xaa' at location 153 stands for lie, Val, or Phe . 
<220> 

<221> misc_feature 

<222> (154) . . (154) 

<223> The 'Xaa' at location 154 stands for Glu, or Asp. 
<220> 

<221> misc_feature 

<222> (157) . . (157) 

<223> The 'Xaa' at location 157 stands for Ser, Gly, or Arg. 
<220> 

<221> misc_f eature 

<222> (159) . . (159) 

<223> The 'Xaa' at location 159 stands for Gly. 
<220> 

<221> misc_feature 

<222> (183) . . (183) 

<223> The 'Xaa' at location 183 stands for Thr. 



<220> 

<221> misc_feature 
<222> (223) . . (223) 
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<223> The 'Xaa' at location 223 stands for Pro. 
<220> 

<221> rtiisc_feature 

<222> (225) . . (225) 

<223> The 'Xaa' at location 225 stands for Ser. 
<220> 

<221> misc_feature 

<222> (237) . . (237) 

<223> The 'Xaa' at location 237 stands for Ser, Thr, lie, Gly, Ala, 
Val, 

Cys, or Phe . 

<220> 

<221> misc_feature 

<222> (239) . . (239) 

<223> The 'Xaa' at location 239 stands for Val. 
<220> 

<221> misc_f eature 

<222> (241) . . (241) 

<223> The 'Xaa' at location 241 stands for Val. 
<220> 

<221> misc_feature 

<222> (242) . . (242) 

<223> The 'Xaa' at location 242 stands for Val, Leu, or Phe. 
<220> 

<221> misc_feature 

<222> (246) . . (246) 

<223> The 'Xaa' at location 246 stands for Lys, Asn, or Thr. 

<220> 

<221> rrisc_f eature 

<222> (251) . . (251) 

<223> The 'Xaa' at location 251 stands for Tyr, Ser, or Phe. 
<220> 

<221> misc_feature 

<222> (259) . . (259) 

<223> The 'Xaa' at location 259 stands for Lea, or Phe. 
<220> 

<221> misc_feature 

<222> (305) . . (305) 

<223> The 'Xaa' at location 305 stands for Leu. 
<220> 

<221> misc_feature 

<222> (319) . . (319) 

<223> The 'Xaa' at location 319 stands for Pro. 
<220> 

<221> misc_f eature 

<222> (346) . . (346) 

<223> The 'Xaa' at location 34 6 stands for Lys, Arg, Thr, Gin, Pro, 
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stop codon, Trp, or Ser. 



<220> 

<221> inisc_feature 

<222> (411) . . (411) 

<223> The ' Xaa ' at location 411 stands for Lys, Thr, or Met. 



<220> 

<221> misc_feature 

<222> (421) . . (421) 

<223> The 'Xaa' at location 421 stands for Thr, Ala, or Ser. 
<220> 

<221> misc_feature 

<222> (457) . . (457) 

<223> The 'Xaa' at location 457 stands for Leu. 
<220> 

<221> misc_feature 

<222> (458) . . (458) 

<223> The 'Xaa' at location 458 stands for Pro. 
<220> 

<221> misc_f eature 

<222> (466) . . (466) 

<223> The 'Xaa' at location 4 66 stands for Pro. 
<220> 

<221> misc_feature 

<222> (487) . . (487) 

<223> The 'Xaa' at location 48^ stands for Gly. 
<220> 

<221> misc_feature 

<222> (493) . . (493) 

<223> The 'Xaa' at location 493 stands for Va.1 . 
<220> 

<221> mi sc_f eature 

<222> (518) . . (518) 

<223> The 'Xaa' at location 518 stands for Thr. 
<220> 

<221> misc_feature 

<222> (536) . . (536) 

<223> The 'Xaa' at location 536 stands for Val . 
<220> 

<221> misc_feature 
<222> (556) . . (556) 

<223> The 'Xaa' at location 556 stands for Met, lie, Val, or Leu. 
<220> 

<221> misc_featare 
<222> (557) . . (557) 

<223> The 'Xaa' at location 55"^ stands for lie, Val, or Leu. 

^ <220> 

<221> raisc_f eature 
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<222> (558) . . (558) 

<223> The ' Xaa ' at location 558 stands for Thr, Ala, or Pro. 



<220> 

<221> misc_feature 

<222> (559) . . (559) 

<223> The 'Xaa' at location 559 stands for Thr, Ala, or Ser. 
<220> 

<221> inisc_f eature 

<222> (561) . . (561) 

<223> The 'Xaa' at location 561 stands for lie. Leu, or Phe. 
<220> 

<221> misc_feature 

<222> (562) . . (562) 

<223> The 'Xaa' at location 562 stands for Lys, Glu, or Gin. 
<220> 

<221> misc_f eature 

<222> (567) . . (567) 

<223> The 'Xaa' at location 567 stands for Thr, Ala, or Ser. 
<220> 

<221> misc_feature 

<222> (572) . . (572) 

<223> The 'Xaa' at location 572 stands for Asn, or Tyr . 
<220> 

<221> mi sc_f eature 

<222> (575) . . (575) 

<223> The 'Xaa' at location 575 stands for Glu, Ala, or Val . 
<220> 

<221> raisc_f eature 

<222> (581).. 1581) 

<223> The 'Xaa' at location 581 stands for Thr, Ala, or Ser. 
<220> 

<221> misc_feature 

<222> (583) . . (583) 

<223> The 'Xaa' at location 583 stands for Asn, Asp, or Tyr. 
<220> 

<221> inisc_f eature 

<222> (617) .. (617) 

<223> The 'Xaa' at location 617 stands for Pro. 
<220> 

<221> misc_feature 

<222> (618).. (618) 

<223> The 'Xaa' at location 618 stands for Pro, or Ser. 
<220> 

<221> misc_f eature 

<222> (643).. (643) 

<223> The 'Xaa' at location 643 stands for Glu, Asp, Gin, His, a stop 
codon, or Tyr. 
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<220> 

<221> misc_f eature 

<222> (647 ) . . (647) 

<223> The 'Xaa' at location 647 stands for Thr, Pro, or Ser. 
<220> 

<221> misc_f eature 

<222> (650) . . (650) 

<223> The 'Xaa' at location 650 stands for Ala, Pro, or Ser. 
<220> 

<221> misc_feature 

<222> (660) .. (660) 

<223> The 'Xaa' at location 660 stands for Thr, Ala, or Ser. 
<220> 

<221> inisc_f eature 

<222> (708) . . (708) 

<223> The 'Xaa' at location 708 stands for Asn, Ser, Thr, Asp, Gly, 
Ala, Tyr, or Cys . 

<220> 

<221> misc_feature 

<222> (710) . . (710) 

<223> The 'Xaa' at location 710 stands for Ser, Arg, or Cys. 

<400> 30 

Met Ala Pro Ser Xaa Asp Ser lie Ala Thr Ser Xaa Ala Asn Gly Xaa 



Xaa Asn Xaa Xaa His Ala Ala Pro Xaa Lys Xaa Xaa Thr Gly Ala Thr 



Ser Xaa Leu Xaa Arg Pro Xaa Xaa Leu Ala Pro Pro Ala Tkr Gin Xaa 
35 4 0 4 5 

Thr Gin Leu Asp He Xaa Glu Xaa He Leu Ala Asp Pro Thr Asp Asp 



Xaa Xaa Glu Leu Asp Gly Tyr Thr Leu Thr Leu Xaa Asp Val Val Gly 
65 70 75 80 

Ala Ala Arg Lys Gly Arg Xaa Val Arg Val Gin Thr Xaa Asp Glu He 



Arg Ala Lys He Asp Xaa Xaa Val Glu Phe Leu Arg Xaa Gin Leu Xaa 
100 105 110 

Asn Xaa Val Tyr Xaa Val Thr Thr Gly Phe Gly Gly Ser Ala Asp Thr 
115 120 125 
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Arg Thr Glu Asp Ala lie Ser Leu Gin Lys Ala Leu Leu Glu His Gin 
130 135 140 



Leu Cys Gly Xaa Leu Xaa Thr Ser Xaa Xaa Ser Phe Xaa Leu Xaa Arg 
145 150 155 160 



Gly Leu Glu Asn Ser Leu Pro Leu Glu Val Val Arg Gly Ala Met Thr 
165 170 175 



lie Arg Val Asn Ser Leu Xaa Arg Gly His Ser Ala Val Arg Leu Val 
180 185 190 



Val Leu Glu Ala Leu Thr Asn Phe Leu Asn His Gly lie Thr Pro lie 
195 200 205 



Val Pro Leu Arg Gly Thr He Ser Ala Ser Gly Asp Leu Ser Xaa Leu 
210 215 220 



Xaa Tyr He Ala Ala Ala He Thr Gly His Pro Asp Xaa Lvs Xaa His 
225 ' 230 235 ' 240 



Xaa Xaa His Glu Gly Xaa Glu Lys He Met Xaa Ala Arg Glu Ala He 
245 250 255 



Ala Leu Xaa Gly Leu Glu Pro Val Val Leu Gly Pro Lys Glu Gly Leu 
260 265 270 



Gly Leu Val Asn Gly Thr Ala Val Ser Ala Ser Met Ala Thr Leu Ala 
275 280 285 



Leu His Asp Ala His Met Leu Ser Leu Leu Ser Gin Ala Leu Thr Ala 
290 295 300 



Xaa Thr Val Glu Ala Met Val Gly His Ala Gly Ser Phe His Xaa Phe 
305 310 315 320 



Leu His Asp Val Thr Arg Pro His Pro Thr Gin He Glu Val Ala Arg 
325 330 335 



Asn He Arg Thr Leu Leu Glu Gly Ser Xaa Phe Ala Val His His Glu 
340 345 350 



Glu Glu Val Lys Val Lys Asp Asp Glu Gly He Leu Arg Gin Asp Arg 
355 360 365 
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Tvr Pro Leu Arg Thr Ser Pro Gin Trp Leu Gly Pro Leu Val Ser Asp 
370 375 380 

Met He His Ala His Ala Val Leu Ser Leu Glu Ala Glu Ser Thr Thr 
385 390 395 400 

Asp Asn Pro Leu He Asp Val Glu Asn Lys Xaa Thr His His Gly Gly 
405 410 415 

Asn Phe Gin Ala Xaa Ala Val Ala Asn Thr Met Glu Lys Thr Arg Leu 
420 425 430 

Ala Leu Ala Leu He Gly Lys Leu Asn Phe Thr Gin Leu Thr Glu Met 
435 440 445 

Leu Asn Ala Gly Met Asn Arg Gly Xaa Xaa Ser Cys Leu Ala Ala Glu 
450 455 460 

Asp Xaa Ser Leu Ser Tyr His Cys Lys Gly Leu Asp Tie Ala Ala Ala 
465 470 475 450 

Ala Tyr Thr Ser Glu Leu Xaa His Leu Ala Asn Pro Xaa Thr Thr His 
485 490 495 

Val Gin Pro Arg Glu Met Gly Asn Gin Ala Val Asn Ser Leu Ala Leu 
500 505 510 

He Ser Ala Arg Arg Xaa Ala Glu Ala Asn Asp Val Leu Ser Leu Leu 
515 520 525 

Leu Ala Thr His Leu Tyr Cys Xaa Leu Gin Ala Val Asp Leu Arg Ala 
530 535 540 

Met Glu Phe Glu Phe Lys Lys Gin Phe Asp Pro Xaa Xaa Xaa Xaa Leu 
545 550 555 560 

Xaa Xaa Gin His Phe Gly Xaa Ala Leu Asp Gly Xaa Glu Leu Xaa Asp 
565 570 575 

Lys Val Asn Lys Xaa Leu Xaa Lys Arg Leu Glu Gin Thr Asn Ser Tyr 
580 585 590 

Asp Leu Glu Pro Arg Trp His Asp Ala Phe Ser Phe Ala Thr Gly Thr 
595 600 605 
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Val Val Glu Leu Leu Ser Ser Ser Xaa Xaa Ala Lys Val Ser Leu Ala 
610 615 620 



Ala Val Asn Ala Trp Lys Val Ala Ser Ala Glu Lys Ala lie Ser Leu 
625 630 635 640 



Thr Arg Xaa Val Arg Asp Xaa Phe Trp Xaa Ala Pro Ser Ser Ser Ser 
645 650 655 



Pro Ala Leu Xaa Tyr Leu Ser Pro Arg Thr Arg Val Leu Tyr Ser Phe 
660 665 670 



Val Arg Glu Glu Leu Gly Val Lys Ala Arg Arg Gly Asp Val Phe Leu 
675 680 685 



Gly Lys Gin Glu Val Thr lie Gly Thr Asn Val Ser Arg lie Tyr Glu 
690 695 700 



Ala lie Lys Xaa Gly Xaa lie Asn His Val Leu Val Lys Met Leu Ala 
705 710 715 720 
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